DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

Education Datasets | DataSalon

All Categories

🎓

Education

Student performance, MOOC logs, knowledge tracing, standardized tests, learning analytics

13,333 datasets

Education

Employability and Skill Set of Newly Graduated Engineers in India: Employer Survey

An employer survey addresses skill shortages in the Indian economy by assessing the importance of, satisfaction with, and gaps in skills among newly graduated engineers. The survey, authored by Andreas Blom, finds 64% of employers are only somewhat satisfied or worse with new hires and identifies significant gaps in higher-order thinking skills. Results suggest engineering education should refocus on soft skills and higher-order cognitive abilities.

Tabular🇮🇳 IndiaPedagogyEmployabilityProgramming LanguageSet Abstract Data TypeEngineeringComputer ScienceSurveyPsychologyFinanceEngineering EducationMathematics EducationEngineering ManagementSkill Gap+1

2 views

Education

Treponema pallidum Proteome Structural Models for Pathogen and Vaccine Research

Deep learning-based structural models for the entire proteome of the Treponema pallidum Nichols strain provide insights into syphilis pathogenesis. The dataset, created by Simon Houston of the Cameron Lab and last updated in April 2026, likely contains predictions for outer-membrane proteins, pathogenesis-related proteins, and B-cell epitopes. This resource is intended for computational analysis to support vaccine and therapeutic development.

TabularSyphilis ResearchVaccine DevelopmentStructural BiologyDeep Learning+1

0 views

Education

Hire Mind Resume Dataset for Recruitment AI and Machine Learning

Hire Mind Resume dataset is a text dataset for recruitment AI and machine learning. The dataset is hosted on Kaggle and is tagged for topics including human resources and the job market. Specific details on size, structure, and provenance are not provided in the available metadata.

TextTabularRecruitment AiHuman ResourcesResume DataJob Market+1

0 views

Education

Australian Continental Shelf Sediment Mobility Predictions from Tidal and Wave Data

Geoscience Australia Data provides predictions of sediment threshold exceedance across the Australian continental shelf. The analysis uses estimates of significant wave height and period, combined with tidal current speeds over a semi-lunar cycle, to map areas where unconsolidated sediment is mobilized. The relative importance of tidal currents and swell waves as sediment-entraining processes is quantified independently.

Time SeriesGeospatial🇦🇺 AustraliaSediment TransportOcean WavesTidal CurrentsContinental Shelf+1

0 views

Education

Camden Fire Risk Assessments for High-Rise Buildings

Fire Risk Assessments for all residential blocks above 10 storeys in the London Borough of Camden, published following a 2017 public commitment by the Council Leader. Each technical assessment includes a summary and a record of subsequent actions taken by the Council. The data is available in multiple structured formats including CSV, JSON, and XML.

TabularCSVXMLJSONFireHousingFire RiskPublic TransparencyBuilding AssessmentsHousing Safety+1

0 views

Education

Reconstruction Results for Optoelectronic Chaos in Machine Learning Acceleration

Reconstruction results generated for the paper 'Large-scale integrated optoelectronic chaos for machine learning acceleration'. The dataset, authored by Zhouyang Pan, is 48.9 MB in size and was last updated on March 22, 2026. It is available in TXT, CSV, and MAT file formats under a CC-BY-4.0 license.

TabularCSVTextMachine LearningComputational PhysicsOptoelectronicsOptical Computing FieldLarge ScaleChaosSynthetic+1

0 views

Education

MS-MARCO-RerankerSample: Training Data for Crossencoder Rerankers

A sample dataset designed for training a crossencoder reranker using exhaustive pairwise learning. The dataset originates from the MS-MARCO platform, a standard benchmark for machine reading comprehension and information retrieval. Its specific size, author, and last update date are not provided.

TextMachine LearningRerankingNatural Language ProcessingInformation Retrieval+1

0 views

Education

Educational Building Energy Consumption Data Set

Kaggle hosts a dataset concerning energy usage in educational buildings. The dataset likely contains measurements of energy consumption over time. Metadata is minimal; actual content requires verification after download.

TabularEnergy ConsumptionEducationBuilding Management+1

0 views

Education

Experimental Data on Student Test Effort Across U.S. and Shanghai Cultures

Jeffrey Livingston from Peking University conducted an experiment measuring the effect of extrinsic incentives on student test performance. The study compares U.S. and Shanghai students, finding incentives improved performance for U.S. students but not for Shanghai students. The dataset likely contains experimental results supporting the analysis that low-stakes international rankings may reflect motivation differences, not just ability.

TabularEcologyComputer SciencePsychologyStudent AssessmentBiologyEducationTest BiologyExperimental DataCross Cultural+1

0 views

Education

Lead Testing in New York State School Drinking Water, 2020 Compliance Year

Compliance Year 2020 results from lead testing in drinking water mandated for all New York State public schools and BOCES. The dataset was reported by each school district to the NYS Department of Health and includes sampling dates, outlet counts, and remediation status. Data is provided by health.data.ny.gov and was last updated on the platform in January 2026.

TabularGeospatialCSVXMLJSONDrinking WaterSchoolsNew York StateHealthcareLead TestingPublic Health+1

0 views

Education

Brazil Solar Radiation Estimates at 10km Resolution

Global horizontal solar radiation in kWh/m2/day for one year organized into cells with 10km x 10km resolution. The data was produced by INPE and LABSOLAR, who assessed the reliability of the BRASIL-SR model by comparing its estimates with ground measurements from solarimetric stations across Brazil.

Geospatial🇧🇷 BrazilClimateBenchmarkSolar Energy+1

0 views

Education

USGS CCAP: Water and Macroinvertebrate Samples from Central Colorado Streams, 2004-2005

Central Colorado streams were sampled for water chemistry and aquatic fauna in 2004 and 2005 by the U.S. Geological Survey's Central Colorado Assessment Project. The dataset includes selected field parameters and analytical results from water and macroinvertebrate samples. The project aims to develop statistical and GIS models to quantify relationships between ecological indicators of metal contamination and landscape characteristics.

TabularGeospatialMining ImpactMacroinvertebratesWater QualityEnvironmental AssessmentGeospatial Analysis+1

0 views

Education

NOAA Coastal Bioeffects Assessment Site Locations from 1986

Thirty Bioeffects Assessments have been conducted as part of the National Status and Trends program. This dataset contains planned and actual sampling location information for sites, beginning with the St. Lucie Estuary Study from 2001. It is compiled by NOAA's National Centers for Coastal Ocean Science.

TabularGeospatialNoaaSite LocationEnvironmental AssessmentCoastal MonitoringMarine Biology+1

0 views

Education

Student Innovation Evaluation Records for Labor Education Analysis

Records of student innovation evaluations intended for labor education analysis. The dataset is hosted on Kaggle, but specific details about its creation date, author, and temporal coverage are not provided. Its size, structure, and specific variables are unknown from the available metadata.

TabularStudent InnovationEducation AssessmentCompetence EvaluationLabor EducationBenchmark+1

0 views

Education

NAWQA Study Unit Boundaries for National Water-Quality Assessment

57 Study Units within the conterminous United States are represented by this GIS coverage from the U.S. Geological Survey's National Water-Quality Assessment (NAWQA) Program. It contains boundaries, names, starting dates, and standard abbreviations for each unit, compiled from hydrologic and state boundary sources. The data was developed for a program initiated in fiscal year 1991 to assess national water resources.

AudioGeospatialEnvironmental monitoringHydrologyGeospatial BoundariesWater QualitySynthetic+1

0 views

Education

Global Land Cover Map at 10-Meter Resolution

ESA WorldCover 2021 provides a global map of 11 land cover classes, such as trees and cropland. The dataset offers worldwide coverage at a 10-meter spatial resolution, organized into 3x3 degree tiles. It was produced by a consortium led by VITO and partners for the reference year 2021.

ImageGeospatialEnvironmental monitoringLand Cover+1

0 views

Education

ESA Global Land Cover Map at 10-Meter Resolution

A global land cover map with 11 distinct land cover classes. The dataset provides worldwide coverage at a 10-meter spatial resolution, organized into 3x3 degree tiles. It was produced for the reference year 2021 by a consortium led by VITO and includes partners like Brockmann Consult and Wageningen University.

GeospatialEnvironmental monitoringLand CoverGeospatial Analysis+1

0 views

Education

Global 10-Meter Resolution Land Cover Map for 2020

2020 global land cover map provides 11 distinct land cover classes. It offers worldwide coverage at a 10-meter spatial resolution, organized into 3 x 3 degree tiles. The product was created by a consortium led by VITO, including Brockmann Consult, CS SI, Gamma Remote Sensing, IIASA, and Wageningen University.

ImageGeospatialEnvironmental monitoringLand Cover+1

0 views

Education

Global Land Cover Map At 10-Meter Resolution

ESA WorldCover provides a global land classification map with 11 distinct land cover classes. The data offers worldwide coverage at a 10-meter spatial resolution, organized into 3x3 degree tiles. This product was created by a consortium led by VITO for the reference year 2020.

GeospatialEnvironmental monitoringGlobal CoverageLand Cover+1

0 views

Education

California River Basins Environmental Data and GIS Layers

California's 149 river basins are covered by a geospatial data management system aggregating over 60 data layers per basin, including vegetation, land ownership, dams, and water quality. The system, developed by SCIOPS, was designed to provide a unified resource for river conservation decisions. Initial development integrated data for 13 demonstration basins, with plans to cover all basins by June 1998.

TabularAudioGeospatialHydrologyCaliforniaRiver Assessment+1

0 views

PreviousPage 308 of 665Next