Loading...
Loading...
Student performance, MOOC logs, knowledge tracing, standardized tests, learning analytics
13,362 datasets
A 9.5 KB Excel file contains content validity indices for a health education module. The data was contributed by author Heba S. M. Mustafaalsaafin and last updated on March 17, 2026. It is licensed under CC-BY-4.0 and hosted on figshare.
United Arab Emirates data on topics and content descriptions for a health educational module. The dataset, created by Heba S. M. Mustafaalsaafin, is a 9.5 KB Excel file last updated in March 2026. It was developed in two phases and validated by six nutrition experts.
An Excel file detailing the study procedures and timing of assessments for a clinical trial on total knee arthroplasty. The dataset, created by Ariena J. Rasker, is licensed under CC-BY-4.0 and was last updated in March 2026. It is a small file of 5.5 KB, suggesting it contains metadata or a study protocol overview.
Educational characteristics, perceived communication competence, and clinical practice context (n=450). The dataset, authored by Jaime Carballedo-Pulido and last updated in March 2026, is a 9.5 KB XLS file available on figshare under a CC-BY-4.0 license.
Over 6000 individual hydrographic station records from the Southern Ocean, assembled from the NODC archive by researchers from Columbia University, Science Applications Inc., and Lamont-Doherty Geological Observatory. The dataset includes interpolated data on a uniform grid at 47 depth levels. It was published in the late 1970s, with the underlying atlas released by Columbia University Press.
Project Token-Exhaustion provides examples of output from the Gemini Search AI. The dataset likely contains text data extracted from PDF documents generated by the AI model. Its exact size, origin, and creation date are unspecified.
Empleados InnovaCorp is a teaching dataset hosted on Kaggle. It is designed to contain realistic data quality issues for hands-on preprocessing practice. The dataset's author, organization, and specific scale are not provided in the available metadata.
A collection of AI-generated images sourced from Civitai, a platform for sharing Stable Diffusion models and content. The dataset includes metadata such as image IDs, model versions, post IDs, URLs, and hashes. It was created by MAPS-research and last updated on March 9, 2026.
A structured dataset derived from the AMC 12 (American Mathematics Competitions), designed for LLM training, evaluation, and reinforcement learning on mathematical reasoning tasks. It contains all AMC 12 problems from 2000 to 2025, making it one of the most complete AMC12 datasets available for research. The dataset was created by edev2000 and last updated on March 17, III.
Real-time pathway optimization data showing how middle school students thrive. The dataset likely contains logs of student interactions and learning pathways. Its source and temporal coverage are unknown.
Indonesian Automated Short Answer Grading (ASAG) data from a Vocational High School context. The dataset likely contains student answers and grading information for a Basic Computer Network course. Its specific scale, authorship, and recency are unknown.
Kaggle hosts this dataset focused on predictive modeling. The specific data content, size, and features are not detailed in the available metadata. Metadata is minimal; actual content requires verification after download.
A dataset for predicting student dropout risk using machine learning techniques. It was published on Kaggle, though the specific source institution and time range are unknown. The dataset likely contains features for modeling student retention.
Example scenes generated by the SceneSmith hierarchical agentic framework for constructing simulation-ready indoor environments from natural language prompts. The dataset contains all scenes from the method and its ablations used in the paper evaluations. Each scene is a complete environment with 3D assets, collision meshes, floor plans, and VLM-estimated physical properties.
NOAA's Southeast Area Monitoring and Assessment Program (SEAMAP) collected temperature, salinity, and dissolved oxygen measurements via CTD and bottle casts from the NOAA Ship OREGON II in the Gulf of Mexico. The dataset represents a State/Federal/university collaborative effort for fishery-independent data collection and spans the period from 2001 to 2012. It is managed by the National Oceanic and Atmospheric Administration, Department of Commerce.
Shanghai Jiao Tong University's annual ranking of global universities from 2003 to 2024. The dataset likely contains scores and ranks based on academic and research performance indicators. Its long time series allows for analysis of institutional performance trends over more than two decades.
Kaggle hosts this dataset of speech features and pronunciation assessment records. The data is intended for defect classification in oral English pronunciation. The author, organization, and specific scale of the collection are not provided.
Kaggle dataset titled 'dropoutlens_ai' likely concerning student dropout prediction. The dataset's specific content, size, and origin are unverified from the provided metadata. Its columns and structure must be inspected after download.
An AusGeo News article outlines the geological and petroleum prospectivity assessment for the Capel and Faust basins. The assessment was conducted by Geoscience Australia's Remote Eastern Frontiers project from 2006 to 2010. It details the regional setting, data acquisition methods, assessment methodology, and study findings.
The 2015 annual report on educational performance for London, published by the Greater London Authority. It is the third such report in a series, with underlying data available via a provided web link. The dataset was last updated in the platform's metadata on 2026-03-25.