Loading...
Loading...
Student performance, MOOC logs, knowledge tracing, standardized tests, learning analytics
13,333 datasets
An employer survey addresses skill shortages in the Indian economy by assessing the importance of, satisfaction with, and gaps in skills among newly graduated engineers. The survey, authored by Andreas Blom, finds 64% of employers are only somewhat satisfied or worse with new hires and identifies significant gaps in higher-order thinking skills. Results suggest engineering education should refocus on soft skills and higher-order cognitive abilities.
Deep learning-based structural models for the entire proteome of the Treponema pallidum Nichols strain provide insights into syphilis pathogenesis. The dataset, created by Simon Houston of the Cameron Lab and last updated in April 2026, likely contains predictions for outer-membrane proteins, pathogenesis-related proteins, and B-cell epitopes. This resource is intended for computational analysis to support vaccine and therapeutic development.
Hire Mind Resume dataset is a text dataset for recruitment AI and machine learning. The dataset is hosted on Kaggle and is tagged for topics including human resources and the job market. Specific details on size, structure, and provenance are not provided in the available metadata.
Geoscience Australia Data provides predictions of sediment threshold exceedance across the Australian continental shelf. The analysis uses estimates of significant wave height and period, combined with tidal current speeds over a semi-lunar cycle, to map areas where unconsolidated sediment is mobilized. The relative importance of tidal currents and swell waves as sediment-entraining processes is quantified independently.
Fire Risk Assessments for all residential blocks above 10 storeys in the London Borough of Camden, published following a 2017 public commitment by the Council Leader. Each technical assessment includes a summary and a record of subsequent actions taken by the Council. The data is available in multiple structured formats including CSV, JSON, and XML.
Reconstruction results generated for the paper 'Large-scale integrated optoelectronic chaos for machine learning acceleration'. The dataset, authored by Zhouyang Pan, is 48.9 MB in size and was last updated on March 22, 2026. It is available in TXT, CSV, and MAT file formats under a CC-BY-4.0 license.
A sample dataset designed for training a crossencoder reranker using exhaustive pairwise learning. The dataset originates from the MS-MARCO platform, a standard benchmark for machine reading comprehension and information retrieval. Its specific size, author, and last update date are not provided.
Kaggle hosts a dataset concerning energy usage in educational buildings. The dataset likely contains measurements of energy consumption over time. Metadata is minimal; actual content requires verification after download.
Jeffrey Livingston from Peking University conducted an experiment measuring the effect of extrinsic incentives on student test performance. The study compares U.S. and Shanghai students, finding incentives improved performance for U.S. students but not for Shanghai students. The dataset likely contains experimental results supporting the analysis that low-stakes international rankings may reflect motivation differences, not just ability.
Compliance Year 2020 results from lead testing in drinking water mandated for all New York State public schools and BOCES. The dataset was reported by each school district to the NYS Department of Health and includes sampling dates, outlet counts, and remediation status. Data is provided by health.data.ny.gov and was last updated on the platform in January 2026.
Global horizontal solar radiation in kWh/m2/day for one year organized into cells with 10km x 10km resolution. The data was produced by INPE and LABSOLAR, who assessed the reliability of the BRASIL-SR model by comparing its estimates with ground measurements from solarimetric stations across Brazil.
Central Colorado streams were sampled for water chemistry and aquatic fauna in 2004 and 2005 by the U.S. Geological Survey's Central Colorado Assessment Project. The dataset includes selected field parameters and analytical results from water and macroinvertebrate samples. The project aims to develop statistical and GIS models to quantify relationships between ecological indicators of metal contamination and landscape characteristics.
Thirty Bioeffects Assessments have been conducted as part of the National Status and Trends program. This dataset contains planned and actual sampling location information for sites, beginning with the St. Lucie Estuary Study from 2001. It is compiled by NOAA's National Centers for Coastal Ocean Science.
Records of student innovation evaluations intended for labor education analysis. The dataset is hosted on Kaggle, but specific details about its creation date, author, and temporal coverage are not provided. Its size, structure, and specific variables are unknown from the available metadata.
57 Study Units within the conterminous United States are represented by this GIS coverage from the U.S. Geological Survey's National Water-Quality Assessment (NAWQA) Program. It contains boundaries, names, starting dates, and standard abbreviations for each unit, compiled from hydrologic and state boundary sources. The data was developed for a program initiated in fiscal year 1991 to assess national water resources.
ESA WorldCover 2021 provides a global map of 11 land cover classes, such as trees and cropland. The dataset offers worldwide coverage at a 10-meter spatial resolution, organized into 3x3 degree tiles. It was produced by a consortium led by VITO and partners for the reference year 2021.
A global land cover map with 11 distinct land cover classes. The dataset provides worldwide coverage at a 10-meter spatial resolution, organized into 3x3 degree tiles. It was produced for the reference year 2021 by a consortium led by VITO and includes partners like Brockmann Consult and Wageningen University.
2020 global land cover map provides 11 distinct land cover classes. It offers worldwide coverage at a 10-meter spatial resolution, organized into 3 x 3 degree tiles. The product was created by a consortium led by VITO, including Brockmann Consult, CS SI, Gamma Remote Sensing, IIASA, and Wageningen University.
ESA WorldCover provides a global land classification map with 11 distinct land cover classes. The data offers worldwide coverage at a 10-meter spatial resolution, organized into 3x3 degree tiles. This product was created by a consortium led by VITO for the reference year 2020.
California's 149 river basins are covered by a geospatial data management system aggregating over 60 data layers per basin, including vegetation, land ownership, dams, and water quality. The system, developed by SCIOPS, was designed to provide a unified resource for river conservation decisions. Initial development integrated data for 13 demonstration basins, with plans to cover all basins by June 1998.