Loading...
Loading...
Student performance, MOOC logs, knowledge tracing, standardized tests, learning analytics
13,381 datasets
Prevalence data for obesity among U.S. children and adolescents aged 2-19 years, spanning from 1963-1965 through 2007-2008. The dataset is based on measured height and weight data from the National Health and Nutrition Examination Survey (NHANES) and uses BMI-for-age growth charts for classification. It was authored by Cynthia L. Ogden and tracks trends, including an increase in obesity from 5.0% to 18.1% for adolescents aged 12-19 between 1976-1980 and 2007-2008.
Global Landsat Analysis Ready Data (ARD) provides a spatially and temporally consistent 16-day time series of normalized surface reflectance from 1997 to the present, operationally updated every 16 days. The dataset is created by the Global Land Analysis and Discovery Lab (GLAD) at the University of Maryland for land cover mapping and change detection. Only data from 2020 onward is available on AWS, with older data accessible via the UMD API.
Kartmaan's French Dictionary is derived from the French Wiktionary, containing nearly 900,000 distinct word forms. It provides structured definitions, usage examples, and linguistic metadata, formatted for both SQLite and Parquet applications.
California's authoritative geographic data source for K-12 public school locations during the 2024-25 academic year. The dataset maps schools as point locations with coordinates and is enriched with demographic and performance variables from the California Department of Education. It includes schools open in October 2024, aligned with official Fall Census Day enrollment counts.
DRAKE is a multi-modal federated continual learning benchmark featuring 40 distinct tasks and between 100,000 and 1,000,000 records. Developed by SNUMPR for ICLR 2026, it evaluates agent knowledge through vision-language question answering under realistic distribution shifts over time.
Multimodal learner interaction data, including VR behavior, engagement, and academic performance metrics. The dataset was sourced from Kaggle, but the author, organization, and last update date are unknown. The specific number of rows, file formats, and data size are also unspecified.
517 multivariate instances of forest fire data from northeast Portugal, donated in 2008. The dataset contains 13 real-valued attributes for predicting burned area as a regression task. It was created by Paulo Cortez and Anรญbal Morais from the University of Minho for a data mining study published in 2007.
Capital Improvement Projects (CIP) for new school projects in New York City that are scheduled to complete design within the next six months and become available for bid. The dataset is published by the City of New York on the Data.gov platform and was last updated on March 15, 2026. It lists upcoming public procurement opportunities for education infrastructure construction.
The Crello dataset is a collection of raster graphic designs originally compiled for studying vector graphic documents. It contains document metadata such as canvas size and pre-rendered elements like images or text boxes, sourced from crello.com (now create.vista.com) and converted to a low-resolution format for machine learning.
The dataset 'mc_dropout_outputs' is published on Kaggle. Its title suggests it contains outputs from a machine learning model using Monte Carlo dropout, a technique for estimating predictive uncertainty. The specific content, size, and origin require verification after download.
Kaggle hosts a dataset for a banana fruit classification task. The dataset likely contains images of bananas intended for training and evaluating machine learning models. Its specific size, origin, and collection date are not detailed in the provided metadata.
A dataset for classifying banana fruit, likely created for an educational machine learning task. It was published on the Kaggle platform. The specific data volume, features, and creation date are not detailed in the available metadata.
Motorbike Data of University Students is a dataset published on Kaggle. The dataset likely contains information about motorbike ownership, usage patterns, or related behaviors among a university student population. Metadata is minimal; actual content requires verification after download.
First ISCCP Regional Experiment - Arctic Cloud Experiment data was collected to improve cloud and radiation parameterizations in General Circulation Models. The dataset originates from the Utrecht University Tower and was managed by the LARC_ASDC organization. It was last updated in May 1998.
Data from the First ISCCP Regional Experiment's Arctic Cloud Experiment collected via a tethered balloon operated by Utrecht University. The experiment aimed to improve cloud parameterizations in climate models by linking satellite data with high-resolution cloud observations. The dataset was last updated by NASA's LARC_ASDC in May 1998.
Aircraft-based measurements of Arctic clouds collected by the University of Washington's CV580 aircraft during the FIRE Arctic Cloud Experiment (ACE) and Surface Heat Budget of the Arctic Ocean (SHEBA) field campaign. The data set was designed to improve understanding of cloud physical processes and their representation in general circulation models. It was published by the NASA Langley Research Center Atmospheric Science Data Center in 1998.
New York State DMV records for regulated driver training businesses. The dataset includes business names, addresses, phone numbers, and the courses offered. It was last updated on March 14, 2026.
Initial Teacher Training and In-Service Practice in Programming, Robotics, and Machine Learning. The dataset was authored by Esteban Vazquez-Cano and hosted on Harvard Dataverse. It was last updated on April 14, 2026.
A dataset for modeling sparse, noisy signals with uncertainty-aware learning. It was uploaded to Kaggle, but the author, organization, and specific creation details are unknown. The dataset's size, row count, and temporal coverage are unspecified.
Teacher_checkpoints is a dataset published on Kaggle. Its specific content and scale are unknown from the provided metadata. The title suggests it likely contains data related to teacher evaluation, training progress, or educational milestones.