Loading...
Loading...
Student performance, MOOC logs, knowledge tracing, standardized tests, learning analytics
13,381 datasets
SpatialVID provides between 1 million and 10 million video records paired with spatial annotations, developed by researchers at Nanjing University and the Chinese Academy of Science for CVPR 2026. The data supports multi-modal generative tasks by linking video sequences with 3D spatial metadata and English text descriptions.
A dataset of Indian government exam results, likely for the year 2026, collected via API and processed using Python. The data appears to be structured for analysis, though specific column details and scale are not provided. The original source is unknown.
Master List of all Approved and Denied Education courses maintained by Vermont's licensing authority. The dataset catalogs courses across various trades and professions, including details on providers, instructors, and course hours. It was last updated by data.vermont.gov in March 2026.
Kaneohe Bay, Oahu, Hawaii hosts a 2003 study examining the growth rate of the sea urchin Tripneustes gratilla for potential algae control on shallow reefs. NOAA's National Centers for Environmental Information (NCEI) provides this dataset, which includes measurements from both natural habitats and experimental introduction sites. The dataset's cross-platform presence on NASA Earthdata and Data.gov indicates its established use in environmental research.
A conceptual rainfall-runoff model developed at the Vienna University of Technology, following the structure of the HBV model. The model runs on a daily or shorter time step and consists of routines for snow, soil moisture, and flow routing. It was created by Alberto Viglione and is documented in a 2007 paper by Parajka et al. in Hydrological Processes.
COHERENT collaboration data release from the first detection of coherent elastic neutrino-nucleus scattering (CEvNS) on argon. The data corresponds to 'Analysis A' published in arXiv:2003.10630 and is intended to enable further studies of CEvNS. The release includes example code and is also available from the COHERENT website at Oak Ridge National Laboratory.
A study of 76 learners of Chinese as a foreign language in the United States examined their foreign language reading anxiety levels. Data was collected via a reading anxiety survey, a background information survey, and face-to-face interviews. The study was authored by Jing Zhou and sourced from the paperswithcode platform.
NOAA_NCEI provides a dataset examining the geochemical behavior of permeable reef flat sediments on Checker Reef, Oahu, Hawaii. It captures spatial and temporal changes in pore water dissolved oxygen, nitrate, nitrite, ammonium, and nitrous oxide. The data was collected between October 1996 and July 1997 to study hydraulic control of geochemistry following significant wave events.
Data from the COHERENT Collaboration associated with the first observation of coherent elastic neutrino-nucleus scattering (CEvNS), published in Science in 2017. The dataset is intended to enable researchers to extend the study of CEvNS, with future collaboration results planned for similar releases. It was authored by D. Akimov of the Institute for Theoretical and Experimental Physics.
semTools provides miscellaneous utilities extending the 'lavaan' package for structural equation modeling. The package, authored by Terrence D. Jorgensen, includes methods for estimating latent interactions and conducting analytical power analyses. It also offers tools for calculating scale reliability based on factor-model parameters.
A dataset of vanilla planifolia fruit images intended for computer vision and deep learning tasks. The dataset is hosted on Kaggle, but its size, specific contents, and creation details are unspecified.
Kaggle dataset titled 'Dropout_shcool' likely contains information on student attrition. The dataset's specific content, size, and origin are not detailed in the provided metadata. Metadata is minimal; actual content requires verification after download.
Kannada Handwritten Glyph Dataset forDeep Learning is a dataset hosted on Kaggle. The dataset likely contains images of handwritten characters from the Kannada script, intended for training machine learning models. The specific number of samples, collection methodology, and authorship details are not provided in the available metadata.
Boston ML Learning is a dataset hosted on Kaggle. Its title suggests it is related to machine learning education, likely containing data for instructional or practice purposes. The dataset's specific content, size, and origin are not detailed in the available metadata.
5,456 Chinese vocabulary entries mapped to the HSK 3.0 standard, published by Tiagodfs and updated in March 2026. It provides a structured list of terms across HSK levels 1 through 6 in a UTF-8 CSV format.
A dataset titled 'Dropout_school' published on Kaggle. The dataset's content and scale are not described in the provided metadata. Its author, organization, and last update date are unknown.
An arboretum forest dataset managed by the ACT Government, authored by Greg Tankard and last updated in March 2026. The dataset documents a botanical garden collection dedicated to tree conservation, scientific research, and education.
An official publication from the Communications Security Establishment Canada explaining how machines use data, algorithms, and learning models to perform tasks requiring human intelligence. The document is provided in HTML format and was last updated in March 2026.
Prevalence data for obesity among U.S. children and adolescents aged 2-19 years, spanning from 1963-1965 through 2007-2008. The dataset is based on measured height and weight data from the National Health and Nutrition Examination Survey (NHANES) and uses BMI-for-age growth charts for classification. It was authored by Cynthia L. Ogden and tracks trends, including an increase in obesity from 5.0% to 18.1% for adolescents aged 12-19 between 1976-1980 and 2007-2008.
Global Landsat Analysis Ready Data (ARD) provides a spatially and temporally consistent 16-day time series of normalized surface reflectance from 1997 to the present, operationally updated every 16 days. The dataset is created by the Global Land Analysis and Discovery Lab (GLAD) at the University of Maryland for land cover mapping and change detection. Only data from 2020 onward is available on AWS, with older data accessible via the UMD API.