DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

Education Datasets | DataSalon

All Categories

🎓

Education

Student performance, MOOC logs, knowledge tracing, standardized tests, learning analytics

13,313 datasets

Education

Telugu STEM Textbook Corpus for NLP and AI Model Development

A large-scale collection of Telugu STEM textbook data created by InfoBayAI and last updated on April 10, 2026. The dataset is designed to support the development of advanced NLP systems and AI models for scientific understanding and problem-solving in Telugu.

TextMultilingualTelugu LanguageStem EducationLarge ScaleNatural Language ProcessingMultilingual NlpText Corpus+1

0 views

Education

AVHRR Level 1b Inventory: Satellite Data for Sea and Cloud Temperature

NOAA/TIROS-N AVHRR sensors measure visible, near-infrared, and infrared radiation across 4 or 5 spectral bands with a ground resolution of about 1.1 km at nadir. The Level 1b data product contains quality-controlled raw data with appended sensor calibration and earth location information, stored on 6250bpi tapes from NESDIS/SDSD and the University of Miami. Data is collected via HRPT (High Resolution Picture Transmission) and LAC (Local Area Coverage) modes from satellites including NOAA-6, 7, 8, 9, 10, and TirosN.

GeospatialAvhrrSatellite ImagerySea Surface TemperatureCloud Top Temperature+1

0 views

Education

Geological Map and Rock Samples from Fosdick Mountains, Antarctica

Marie Byrd Land in West Antarctica is covered by a 16,000 km² GIS-based 3-D geological map consolidating bedrock geology, airborne geophysics, structural data, and geochronology. The resource includes rock sample collections, predominantly migmatite gneisses and plutonic rocks, registered with IGSN numbers. The database is intended for publication as a dynamic GIS by the Antarctic Geospatial Information Center at the University of Minnesota.

TabularGeospatialMultimodalAntarcticaGeochronologyGeology+1

0 views

Education

Kenya Coastal Zone Boundaries and School Locations

Kenya's coastal zone is mapped in a 1:250,000-scale vector database containing international and administrative boundaries, along with school locations. The database was developed under the Eastern African Action Plan by UNEP/GRID-PAC, UNEP/OCA-PAC, and KEMFRI, integrating data from the Survey of Kenya, Landsat imagery, and socio-economic sources. Feature names and attributes are stored for points, lines, and polygons.

GeospatialSchoolsCoastal managementKENYAAdministrative BoundariesFinance+1

0 views

Education

Global Adult Illiteracy Rates from 2005 to 2024

2005-2024 is the stated temporal coverage for this dataset of adult illiteracy rates. It likely contains country-level or regional statistics on literacy, a key socioeconomic indicator. The dataset is published on Kaggle, but its original source and compilation method are unknown.

TabularTime SeriesLiteracyEducationGlobal HealthSocioeconomic Indicators+1

0 views

Education

MAGIC-IRRI: Synthetic Plant Genetics Data for Trait Prediction

MAGIC populations are ideal for learning complex models due to their high genetic recombination, diversity, and large sample size. This synthetic dataset of 2000 observations was generated from a Bayesian network model developed for a talk on multiple trait prediction in plant genetics. The model and data were created by Marco Scutari, Phil Howell, David J Balding, and Ian Mackay.

TabularMachine LearningBayesian NetworksAgriculturePlant GeneticsSensor DataSynthetic DataIrrigationMagic PopulationSynthetic+1

0 views

Education

Paper Conclusion RL Training: Qwen3-VL-8B-Thinking with External Judge

Paper Conclusion RL Training is a dataset for reinforcement learning training based on the EasyR1 (verl) framework. The training model is Qwen3-VL-8B-Thinking, using an external judge model (Qwen3-4B-Instruct-2507) to score predicted conclusions against a 235B teacher model's reference conclusions. The dataset was authored by SII-ChengqiLi and last updated on 2026-04-10.

TextAcademic TextModel TrainingReinforcement LearningLarge Language Models+1

0 views

Education

Great Barrier Reef Conference Papers, August-September 1983

A collection of papers published for the inaugural Great Barrier Reef Conference held at James Cook University in Townsville. The dataset is provided by Geoscience Australia and was last updated on the platform in April 2026. The legacy product has no abstract available, and the specific content and structure require verification after download.

Text🇦🇺 AustraliaCoral ReefMarine ScienceConference ProceedingsGeoscience+1

0 views

Education

Madonna University Application and Registration Form Data

A dataset containing application and registration form information for Madonna University. The data likely includes details submitted by prospective students for the 2026/2027 academic year. The specific fields, volume, and completeness are not detailed in the available metadata.

TabularApplication FormsUniversity AdmissionsEducation Data+1

0 views

Education

Igbinedion University Okada 2026/2027 Application and Registration Form

Igbinedion University Okada 2026/2027 application and registration form data is hosted on Kaggle. The dataset likely contains information submitted by prospective students for the 2026/2027 academic session. Its author, organization, and specific data structure are unknown.

TabularApplication FormsUniversity AdmissionsNigeriaHigher Education+1

0 views

Education

Babcock University Application and Registration Form Data

Babcock University, Ilishan-Remo, has released its application and registration form for the 2026/2027 academic session. The dataset likely contains information submitted by prospective students during the university's admissions process. Specific details on data volume, structure, and collection method are not provided in the available metadata.

TabularNigeria EducationUniversity AdmissionsStudent Registration+1

0 views

Education

Ethiopia Literacy and Education Levels from 2016 DHS Survey

Ethiopia's literacy and education levels derived from the 2016 Standard Demographic Health Survey (DHS). The data is provided by the Central Statistical Agency (CSA) of Ethiopia and published by the IGAD Climate Prediction and Applications Center (ICPAC). It is available as a GEOTIFF raster file with a spatial resolution of 0.05 pixels.

GeospatialGeodataLiteracy RatesHealthcareEducationDemographic Health SurveyEthiopia+1

0 views

Education

U.S. Social Security Administration Employee Hires and Losses Tracking

Social Security Administration data tracks employee hires, losses, and internal transfers within its Office of Systems. The database includes information on employee skills, job assignments, and transfer approvals. It was last updated on April 3, 2026.

TabularEmployee TurnoverTransferEmployee SkillsGovernment WorkforceLossSkills InventoryStaff TransferHuman ResourcesEmployee AssignmentHire+1

0 views

Education

FIRE Arctic Cloud Experiment: Tethered Balloon Data for Climate Model Validation

First ISCCP Regional Experiment data from a tethered balloon campaign designed to improve cloud and radiation parameterizations in climate models. The dataset, provided by the National Aeronautics and Space Administration, includes files in BIN, TAR, ISO, HTML, and PDF formats. It focuses on the life cycles and radiative properties of Arctic clouds to validate satellite data and general circulation models.

GeospatialMultimodalEarth Science Atmospheric Winds Atmosphere SurfaceArctic ResearchAtmospheric ScienceEarth Science Atmospheric Pressure AtmosphereSatellite DataCloud PhysicsClimate ModelingEarth Science Atmospheric Temperature Atmosphere S+1

0 views

Education

FIRE Arctic Cloud Experiment: Tower-Based Atmospheric Measurements

The First ISCCP Regional Experiments (FIRE) data, produced by the National Aeronautics and Space Administration, aims to improve cloud and radiation models for climate prediction. The dataset includes measurements from the Arctic Cloud Experiment Utrecht University Tower, focusing on cirrus and marine stratocumulus cloud systems. The data was last updated on 2026-03-13.

GeospatialMultimodalEarth Science Atmospheric Winds Atmosphere SurfaceAtmospheric ScienceEarth Science Atmospheric Radiation Atmosphere LonEarth Science Atmospheric Radiation Atmosphere ShoEarth Science Atmospheric Water Vapor Atmosphere WCloud RadiationClimate ModelingEarth Science Atmospheric Temperature Atmosphere S+1

0 views

Education

MolmoWeb-SyntheticSkills: Web Navigation Agent Trajectories with Screenshots and Actions

MolmoWeb-SyntheticSkills is a dataset of synthetic web-navigation skills created by Allen Institute for AI (allenai). Each example pairs an instruction with a sequence of webpage screenshots and the corresponding low-level agent actions like clicks, typing, and scrolling. The dataset was last updated on March 24, 2026.

MultimodalParquetSize Categories1 Kn10 KLibrarypolarsLibrarydaskModalitytextLibrarymlcroissantModalityimageLibrarydatasetsRegionusWeb NavigationSynthetic DataSyntheticAgent TrajectoriesHuman Computer Interaction+1

0 views

Education

Northern Ireland Post-Primary School Examination Performance 2023-24

The 2023-24 academic year data on examination performance for Year 12 and Year 14 pupils in Northern Ireland. These data are gathered as part of the annual Summary of Annual Examination Results (SAER) exercise, which runs from May to December each year. The dataset is published by the Government Digital Service under the OGL-UK-3.0 license.

TabularA LevelSecondary EducationExamination ResultsNorthern IrelandEducationExaminationGcseSaer+1

0 views

Education

Child Development Centers in the District of Columbia

A list of child development centers provided by the District of Columbia's Office of the State Superintendent of Education. The dataset was last updated on March 25, 2026. It is available in multiple geospatial and tabular formats, including KML, GeoJSON, and CSV.

TabularGeospatialZIPCSVPublic ServiceYouthChildrenWashington DcChild DevelopmentChildEducationHealthDistrict Of ColumbiaDoh+1

0 views

Education

Global K-12 STEM, Robotics, AI and Engineering Education Data

Global K-12 STEM, Robotics, AI & Engineering Education Dataset (Grades 1–12) is aggregated from Kaggle. Its specific size, source, and update frequency are not detailed in the available metadata. The dataset likely contains information on educational programs, resources, or outcomes related to STEM fields.

TabularK 12RoboticsStem EducationEngineering EducationAi Education+1

0 views

Education

PolyglotTeachers SFT Synth: Multilingual Supervised Fine-Tuning Examples

Synthetic supervised fine-tuning examples were generated by teacher models evaluated in the Polyglot Teachers paper. The dataset contains examples across six languages: Arabic, Czech, German, Indonesian, Japanese, Spanish, and Tagalog. It was created by ljvmiranda921 and last updated on April 5, 2026.

TextMultilingualLanguage ModelsSynthetic DataSupervised Fine TuningSynthetic+1

0 views

PreviousPage 289 of 664Next