DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

Education Datasets | DataSalon

All Categories

🎓

Education

Student performance, MOOC logs, knowledge tracing, standardized tests, learning analytics

13,381 datasets

Education

U.S. Child and Adolescent Obesity Prevalence Trends, 1963-2008

Prevalence data for obesity among U.S. children and adolescents aged 2-19 years, spanning from 1963-1965 through 2007-2008. The dataset is based on measured height and weight data from the National Health and Nutrition Examination Survey (NHANES) and uses BMI-for-age growth charts for classification. It was authored by Cynthia L. Ogden and tracks trends, including an increase in obesity from 5.0% to 18.1% for adolescents aged 12-19 between 1976-1980 and 2007-2008.

TabularTime SeriesMedicineEnvironmental HealthChild HealthPsychologyObesityHealthcareEndocrinologyGeographyDemographySociologyPublic Health+1

0 views

Education

GLAD Landsat ARD: Global Landsat Surface Reflectance Time Series

Global Landsat Analysis Ready Data (ARD) provides a spatially and temporally consistent 16-day time series of normalized surface reflectance from 1997 to the present, operationally updated every 16 days. The dataset is created by the Global Land Analysis and Discovery Lab (GLAD) at the University of Maryland for land cover mapping and change detection. Only data from 2020 onward is available on AWS, with older data accessible via the UMD API.

Time SeriesGeospatialSatellite ImageryAgricultureNatural ResourceEarth ObservationLand CoverCog+1

0 views

Education

French Wiktionary Dictionary with 900,000 Word Forms

Kartmaan's French Dictionary is derived from the French Wiktionary, containing nearly 900,000 distinct word forms. It provides structured definitions, usage examples, and linguistic metadata, formatted for both SQLite and Parquet applications.

FrenchSQLiteParquetLibrarypolarsSize Categories1 Mn10 MDictionnaireModalitytextModalitytabularLibrarymlcroissantLicensecc By Sa 40DictionaryLibrarydatasetsDatabaseLibrarypandasFranaisOfflineWiktionaryRegionusNatural Language Processing+1

0 views

Education

California K-12 Public School Locations for 2024-25 Academic Year

California's authoritative geographic data source for K-12 public school locations during the 2024-25 academic year. The dataset maps schools as point locations with coordinates and is enriched with demographic and performance variables from the California Department of Education. It includes schools open in October 2024, aligned with official Fall Census Day enrollment counts.

EducationMasterCalifornia Department Of EducationOpen DataSchool Site+1

0 views

Education

DRAKE: Multi-Modal Federated Learning Benchmark with 40 Tasks

DRAKE is a multi-modal federated continual learning benchmark featuring 40 distinct tasks and between 100,000 and 1,000,000 records. Developed by SNUMPR for ICLR 2026, it evaluates agent knowledge through vision-language question answering under realistic distribution shifts over time.

MultimodalTask Categoriesquestion AnsweringLanguageenArxiv220501917Size Categories100 Kn1 MVision LanguageLicensecc By 40Continual LearningRegionusArxiv230607890AgentArxiv230609344Federated LearningArxiv241012705+1

0 views

Education

AI–IoT English Learning Behavior with Multimodal Interaction and VR Data

Multimodal learner interaction data, including VR behavior, engagement, and academic performance metrics. The dataset was sourced from Kaggle, but the author, organization, and last update date are unknown. The specific number of rows, file formats, and data size are also unspecified.

MultimodalEnglish LearningIot BehaviorMultimodal InteractionAcademic PerformanceVr Education+1

0 views

Education

Forest Fire Burn Area Predictions in Northeast Portugal

517 multivariate instances of forest fire data from northeast Portugal, donated in 2008. The dataset contains 13 real-valued attributes for predicting burned area as a regression task. It was created by Paulo Cortez and Aníbal Morais from the University of Minho for a data mining study published in 2007.

TabularWildfire PredictionEnvironmental scienceRegressionMeteorological DataClimate DataForest FiresSynthetic+1

0 views

Education

Upcoming New York City School Construction Contracts for Bid

Capital Improvement Projects (CIP) for new school projects in New York City that are scheduled to complete design within the next six months and become available for bid. The dataset is published by the City of New York on the Data.gov platform and was last updated on March 15, 2026. It lists upcoming public procurement opportunities for education infrastructure construction.

TabularCSVXMLJSONImprovementProjectSchoolAuthorityScaPublic procurementCapital ProjectsEducationConstruction BidsCompleteEducation InfrastructureConstructionCompletionSchool ConstructionDesignCapitalBid+1

0 views

Education

Raster Graphic Design Templates with Document Metadata

The Crello dataset is a collection of raster graphic designs originally compiled for studying vector graphic documents. It contains document metadata such as canvas size and pre-rendered elements like images or text boxes, sourced from crello.com (now create.vista.com) and converted to a low-resolution format for machine learning.

ParquetSource DatasetsoriginalSize Categories10 Kn100 KLibrarypolarsArxiv250925134LibrarydaskLanguageenLanguage CreatorsfoundModalitytimeseriesModalitytextAnnotations Creatorsno AnnotationLibrarymlcroissantModalityimageLibrarydatasetsTask Categoriesimage SegmentationRegionusGraphic DesignMultilingualitymonolingual+1

0 views

Education

MC Dropout Outputs: Model Predictions with Uncertainty Estimates

The dataset 'mc_dropout_outputs' is published on Kaggle. Its title suggests it contains outputs from a machine learning model using Monte Carlo dropout, a technique for estimating predictive uncertainty. The specific content, size, and origin require verification after download.

TabularMachine LearningModel OutputsUncertainty Quantification+1

0 views

Education

Banana Fruit Images for Classification Tasks

Kaggle hosts a dataset for a banana fruit classification task. The dataset likely contains images of bananas intended for training and evaluating machine learning models. Its specific size, origin, and collection date are not detailed in the provided metadata.

ImageMachine LearningComputer VisionAgricultureImage ClassificationBananaFruit ClassificationFruit+1

0 views

Education

Banana Fruit Classification Dataset

A dataset for classifying banana fruit, likely created for an educational machine learning task. It was published on the Kaggle platform. The specific data volume, features, and creation date are not detailed in the available metadata.

TabularMachine LearningEducationBananaFruit Classification+1

0 views

Education

Motorbike Usage Data from University Students

Motorbike Data of University Students is a dataset published on Kaggle. The dataset likely contains information about motorbike ownership, usage patterns, or related behaviors among a university student population. Metadata is minimal; actual content requires verification after download.

TabularSurvey DataTransportationUniversity StudentsMotorbike+1

0 views

Education

Arctic Cloud Experiment Data for Climate Model Validation

First ISCCP Regional Experiment - Arctic Cloud Experiment data was collected to improve cloud and radiation parameterizations in General Circulation Models. The dataset originates from the Utrecht University Tower and was managed by the LARC_ASDC organization. It was last updated in May 1998.

Time SeriesGeospatialCloud PhysicsSatellite ObservationsClimate Modeling+1

0 views

Education

Arctic Cloud Observations from Tethered Balloon Experiment

Data from the First ISCCP Regional Experiment's Arctic Cloud Experiment collected via a tethered balloon operated by Utrecht University. The experiment aimed to improve cloud parameterizations in climate models by linking satellite data with high-resolution cloud observations. The dataset was last updated by NASA's LARC_ASDC in May 1998.

Time SeriesGeospatialAtmospheric ScienceCloud PhysicsSatellite ValidationArctic Climate+1

0 views

Education

Arctic Cloud Measurements from the FIRE ACE Aircraft Campaign

Aircraft-based measurements of Arctic clouds collected by the University of Washington's CV580 aircraft during the FIRE Arctic Cloud Experiment (ACE) and Surface Heat Budget of the Arctic Ocean (SHEBA) field campaign. The data set was designed to improve understanding of cloud physical processes and their representation in general circulation models. It was published by the NASA Langley Research Center Atmospheric Science Data Center in 1998.

TabularTime SeriesAtmospheric ScienceCloud PhysicsAircraft MeasurementsClimate Modeling+1

0 views

Education

New York DMV Driving School Registry with Address and Course Information

New York State DMV records for regulated driver training businesses. The dataset includes business names, addresses, phone numbers, and the courses offered. It was last updated on March 14, 2026.

TabularCSVXMLJSONPre Licensing CourseNew YorkBusiness RegistryDriving SchoolsDriving SchoolDriver TrainingDriving Lessons+1

0 views

Education

Prombot_Teachers

Initial Teacher Training and In-Service Practice in Programming, Robotics, and Machine Learning. The dataset was authored by Esteban Vazquez-Cano and hosted on Harvard Dataverse. It was last updated on April 14, 2026.

TabularProgramming EducationTeacher TrainingMachine Learning EducationRobotics Education+1

0 views

Education

Sparse Noisy Signals for Uncertainty-Aware Learning

A dataset for modeling sparse, noisy signals with uncertainty-aware learning. It was uploaded to Kaggle, but the author, organization, and specific creation details are unknown. The dataset's size, row count, and temporal coverage are unspecified.

Time SeriesMachine LearningSignal Processing+1

0 views

Education

Teacher Checkpoints: Performance or Training Data

Teacher_checkpoints is a dataset published on Kaggle. Its specific content and scale are unknown from the provided metadata. The title suggests it likely contains data related to teacher evaluation, training progress, or educational milestones.

TabularTeacher PerformanceEducationCheckpoints+1

0 views

PreviousPage 349 of 668Next