Loading...
Loading...
Student performance, MOOC logs, knowledge tracing, standardized tests, learning analytics
13,418 datasets
World Health Organization data identifies the primary public sector organizations providing training and education for dementia caregivers. The dataset focuses on the majority provider within this sector. The data is compiled by the WHO's Global Health Observatory.
80 documented cloud misconfigurations, 50 best practices, and 100 educational Q&A pairs form this bilingual English/French dataset. The dataset, created by AYI-NEDJIMI, was last updated on February 13, 2026. It focuses on common security errors and guidance for major cloud platforms.
Kaggle hosts this dataset concerning student performance. The dataset likely contains variables related to academic outcomes and influencing factors. Its specific size, origin, and temporal coverage are not detailed in the provided metadata.
A dataset titled 'HW3.MachineLearning_lo' published on Kaggle. The title suggests it is likely associated with a third homework assignment in a machine learning course. Its specific content, size, and origin are unknown from the provided metadata.
Bolivia-education-corpus-v10 is a dataset containing Bolivian education data focused on indigenous languages. The data is intended for use in developing offline small language models. The dataset's author, organization, and specific collection details are not provided.
AI Tutor Student Performance 2026 Synthetic is a synthetic dataset created for exploratory data analysis and student risk prediction. The data simulates AI tutor usage and student outcomes for the year 2026. Its origin, exact size, and specific features are not detailed in the provided metadata.
Pakistan's education system is represented by 10,000 school records spanning seven provinces. The dataset includes 34 features covering enrollment and infrastructure metrics. It was sourced from Kaggle, but the author, organization, and specific collection method are unknown.
4.79 million Slovenian web documents enriched with metadata for educational value, domain classification, and web registers. The dataset, created by zID4si, includes a filtered subset optimized for language model pre-training. It was last updated on February 13, 2026.
Every assessed property value in Cook County from 2010 to 2020 includes three distinct assessment stages per year: CCAO Mailed, CCAO Certified, and Board of Review Certified final values. The data, archived from the Cook County Assessor's office, allows direct comparison of valuations across different government agencies.
Transfer Learning Updated Outputs is a dataset hosted on the Kaggle platform. The title suggests it contains results or predictions from models that have undergone transfer learning. Specific details regarding its size, origin, and creation date are not provided in the available metadata.
Kaggle hosts a dataset related to the textbook 'Machine Learning' by Tom Mitchell. The dataset likely contains examples, exercises, or supplementary materials for educational use. The specific contents, size, and origin of the data require verification after download.
AI-MO released this collection of approximately 900,000 competition-level math problems in early 2026 for model post-training. It features Chain of Thought (CoT) solutions derived from Chinese high school exercises and international mathematics olympiads.
An R script for Structural Equation Modeling analysis of the MiCREATE UK survey dataset, authored by Charles Tendai. The script is hosted by Harvard Dataverse and was last updated in March 2026. Specific data dimensions like row and column counts are not provided in the input.
District of Columbia data from datagov, last updated on 2026-03-25. The dataset relates to the District's educational vision for student success and school accountability. It is published under a CC-BY-4.0 license.
District of Columbia data details regulations, outreach, education, and incentive services aimed at promoting sustainable urban behavior. The dataset covers initiatives targeting residents, businesses, and institutions within the DC area. Specific row counts, column details, and file formats beyond HTML are unavailable.
Student Performance Prediction (1000 inputs) is a dataset hosted on Kaggle. The title suggests it contains data for predicting student academic outcomes. The dataset's specific source, collection method, and temporal coverage are unknown.
Processed NIfTI images from the EMIDEC dataset for myocardial infarction assessment. The dataset includes Delayed-Enhancement MRI scans with segmentation labels for the left ventricle, myocardium, infarction, and no-reflow zones. It was uploaded by user viennh2012 to Hugging Face and last updated on 2026-02-16.
GEAR scrapes forum posts from the Greek educational community website ischool.gr, where students express anxiety about the Panhellenic exams. The dataset pairs these student posts with responses from three large language models, including Krikri and Aya, to evaluate LLM empathy and reasoning. It was created by PennyK98 and last updated on February 24, 2026.
House Bill 1599 established statewide high school graduation pathways, requiring annual reporting by the Office of Superintendent of Public Instruction. This dataset tracks pathway completion rates for all four-year graduation cohorts from the Class of 2020 onward, as mandated by the 2019 legislation.
This longitudinal dataset tracks psychological outcomes for 22 Afghan youth survivors of the Kaaj Education Center attack following Memory-Focused Therapy (MFT). Collected by Sayed Jafar Ahmadi and published in 2026, the data includes standardized mental health assessments at baseline, post-intervention, and a three-month follow-up.