Loading...
Loading...
Student performance, MOOC logs, knowledge tracing, standardized tests, learning analytics
13,391 datasets
MediX-R1 is an open-ended medical reinforcement learning dataset created by Mohamed Bin Zayed University of Artificial Intelligence (MBZUAI). The dataset is associated with three model variants: MediX-R1-2B, MediX-R1-8B, and MediX-R1-30B. The dataset page was last updated on February 27, 2026.
A WMS service providing geospatial data for the 'Bildungscampus' urban development plan in Heilbronn, transformed according to INSPIRE standards. The dataset is based on an XPlanung dataset in version 5.4 and is provided by the Bundesamt für Kartographie und Geodäsie. It was last updated on March 9, 2026.
XPlanung 5.4 WMS service provides the geospatial development plan 'Parkhaus Bildungscampus' for the city of Heilbronn. The plan, identified as '02A/31', details zoning and construction specifications for a car park serving an educational campus. The service is hosted by the Bundesamt für Kartographie und Geodäsie and was last updated on March 9, 2026.
A WMS service provides the "Bildungscampus" development plan for the city of Heilbronn. The Bundesamt für Kartographie und Geodäsie is listed as the organization. The service was last updated on 2026-03-09.
A development plan for the 'Parkhaus Bildungscampus' (Educational Campus Car Park) in Heilbronn, Germany, designated as plan 02A/31. The data is provided as a WFS (Web Feature Service) service conforming to the XPlanung 5.4 standard. The dataset is published by the Bundesamt für Kartographie und Geodäsie and was last updated on 2026-03-09.
This dataset comprises 1.84 million item responses aggregated from 40 empirical psychometric datasets. It was created by Josh Gilbert for a meta-analysis investigating conditional dependencies between response time and item discrimination. The analysis found a pooled negative coefficient of -0.27% for the relationship between residual response time and item discrimination.
Chinese adolescents survey data supports structural equation modeling (SEM) analysis of school satisfaction pathways. The dataset, authored by Yukai Wei, is a 3.0 MB XLS file last updated in March 2026. Row and column counts are not specified.
Clothing-ADC is a dataset constructed using Automatic Dataset Construction (ADC) methods, as described in a paper by authors from UC Santa Cruz, HKUST(GZ), and other institutions. The dataset is hosted on Hugging Face and was last updated on March 23, 2026. Platform tags indicate it is a web-crawled collection focused on fashion and clothing images, likely containing millions of samples.
DriverGaze360 contains approximately 1 million gaze-labeled frames captured from 19 human drivers to support omnidirectional driver attention modeling. Developed by dfki-av and released around 2025, the data provides a 360-degree field of view for analyzing visual attention in driving scenarios. It includes object-level guidance to assist in mapping gaze to specific environmental elements.
A collection of standardized tasks for assessing mechanistic reasoning in AI agents, created by vida-nyu. The dataset provides experimental context, molecular signatures, and prompts to test an agent's ability to reconstruct explanations from peer-reviewed biological studies. It was last updated on February 27, 2026.
Featuring replication data from a paired cluster-randomized trial evaluating the 'Pathways to Choice' intervention in 18 communities in northern Nigeria. The study found the intervention decreased marriage rates among adolescent girls from 79% in the control group to 14% in the treatment group.
Supplementary material from a 2026 study on the association between fusion visual function deficits and myopia in school-aged children. The dataset, 464.5 KB in size, supports analysis of dynamic visual screening and binocular vision disorders.
1,800 rollout records profiling the frontier difficulty of 900 ShoppingBench tasks using the GPT-OSS-120B model. Created by Jarrodbarnes via the Dynamical environment factory in February 2026, it includes 100 teacher-guided mutations designed to maximize learning signals for reinforcement learning. The data identifies specific 'hillclimbable' tasks where model improvement is most likely.
Oceanographic station data provides a uniform array for studying the physical and chemical properties of the Southern Ocean water column. The dataset was compiled by the Lamont-Doherty Geophysical Observatory of Columbia University and the NOAA National Centers for Environmental Information. Data collection spans from 1947 to 1980, with the atlas format including grid point data files and objective contouring graphics.
Country-level data on selected national public response measures to COVID-19, as presented in weekly overview reports. The dataset includes measures such as gathering cancellations, closures of public spaces and schools, stay-at-home orders, and mask mandates. It was compiled by Marica Teresa Rocca of the University of Pavia, based on raw data from the European Centre for Disease Prevention and Control (ECDC).
Southwest Nigeria is the geographic scope for this survey of 431 students aged 15-25 from four private universities. The data was collected using an adapted WHO drug use questionnaire and analyzed by Olujide Adekeye. Results show prevalence rates for substances like cigarettes (81%) and alcohol (72%), and identify age as a significant predictor of use.
Baglung Municipality survey data on musculoskeletal disorders among secondary school teachers, authored by Shishir Paudel and last updated in March 2026. The dataset is provided as a 163.2 KB XLSX file under a CC BY 4.0 license.
Data on selected national public response measures to COVID-19, as presented in weekly country overview reports. The dataset includes measures such as gathering cancellations, closures of public spaces and schools, stay-at-home orders, and mask mandates. It is based on raw data from the European Centre for Disease Prevention and Control (ECDC), harmonized by the University of Pavia.
Replication files support a study conditionally accepted at PNAS as of July 2026. The data was authored by Max Bradley and uploaded to Dataverse in April 2026. It facilitates verification and extension of the research linking educational policies to climate coalition strength.
PL-300 exam preparation materials published on Kaggle. The dataset likely contains questions and answers intended to simulate the Microsoft Power BI Data Analyst certification test. Its content and structure require verification after download.