Loading...
Loading...
Student performance, MOOC logs, knowledge tracing, standardized tests, learning analytics
13,372 datasets
Chicago Public Schools implemented a probation policy in 1996 for elementary schools where fewer than 15% of students scored at grade-level norms on the Iowa Test of Basic Skills in reading. The dataset likely contains results from a two-year study analyzing the design and implementation of this policy, including the external support provided to schools. The study was authored by Kara Finnigan and focuses on the policy's consequences and assistance mechanisms.
A major longitudinal effort by the United States Department of Education to track the 1988 eighth-grade cohort through high school and beyond. The study collects policy-relevant data on educational processes, dropout predictors, and equal opportunity from four components: Parent, School Administrator, Student, and Teacher. This first stage provides foundational trend data on critical transitions from elementary school.
June to July 1971 data from the SUBARCTIC-A cruise of the R/V YAQUINA details the vertical distribution of zooplankton at Ocean Station 'P' in the Subarctic Pacific. The analog report contains tabular estimates of zooplankton displacement volumes and numerical densities for many species or categories at different depths. This archival dataset provides a snapshot of pelagic community structure from a historically significant oceanographic station.
A machine learning model checkpoint named 'tom_checkpoint_dropout_0.3' published on Kaggle. The dataset likely contains saved model parameters and architecture details. Its specific contents and creation date are unknown.
A dataset titled TOM_BEST_NO_DROPOUT, published on Kaggle. Its name suggests a focus on student performance or educational outcomes, specifically excluding dropout cases. The dataset's content, scale, and authorship are unknown from the provided metadata.
Lyrics For Learning is a text dataset published on Kaggle. The title suggests it contains song lyrics intended for educational or analytical purposes. Metadata is minimal; actual content, size, and structure require verification after download.
Kaggle hosts a dataset from a machine learning competition. The specific topic and scale are not detailed in the provided metadata. The data is likely structured for predictive modeling tasks typical of the platform.
Student Performance Prediction Dataset is a dataset hosted on Kaggle. The dataset likely contains information for predicting student academic outcomes. Specific details such as column definitions, size, and origin are not provided in the metadata.
A dataset concerning student exam performance, sourced from the Kaggle platform. The specific number of records, features, and temporal coverage are not provided in the available metadata. The dataset's content and structure require verification after download.
Greater London Authority provides a dataset of air quality exposure for educational institutions, broken down by parliamentary constituency. The data is based on the 2013 London Atmospheric Emissions Inventory and includes all educational establishments except private nurseries. The legal limit for NO2 is an average annual concentration of 40 ug/m3.
1,062 synthetic multi-turn conversations between a user and an AI assistant focus on practical agentic tasks like train ticket booking, dynamic form filling, and payment processing. Created by DataCreator AI, the dataset includes diverse scenarios such as successful execution, context retrieval, tool integration, and failure recovery.
A study analyzing factors influencing student learning across countries, using data from the Programme for International Student Assessment (PISA). The dataset, authored by Gregory J. Marchant, examines relationships between student demographics, academic strategies, metacognition, and achievement. The primary data source is PISA 2009, focusing on cognitive and control strategies used by students.
Runmin Wei authored a software package for evaluating multi-class classification models. The package computes areas under ROC and PR curves using micro-averaging and macro-averaging methods. Its methodology references academic papers by Van Asch (2013) and Pedregosa et al. (2011).
A formal toolkit for rating instructional quality based primarily on classroom observation and student assignments. The toolkit was developed for reading comprehension and mathematics at the elementary school level. A large pilot study was conducted in Spring 2003 in two moderately large urban school Districts.
Lieutenant Colonel John A. Nagl's book compares counterinsurgency doctrine development in the Malayan Emergency (1948-1960) and the Vietnam War (1950-1975). The analysis uses archival sources and participant interviews to argue that organizational culture determines an army's ability to adapt. A new preface reflects on the author's combat experience in Iraq.
SFT-Dataset is a collection for supervised fine-tuning of base language models like Qwen/Qwen3-4B-Base. The dataset is tagged for text generation, reasoning, math, science, and code tasks. It is authored by 96kevinli29 and was last updated in March 2026.
Teacher data published on Kaggle. The dataset's specific content and scope are not detailed in the provided metadata. Columns, sample data, and other specifics are unknown.
DigitalCorpora provides disk images, memory dumps, and network packet captures for digital forensics research. The data is synthetic, created by students and faculty acting in persona, allowing use without prior authorization or IRB approval. This collection is hosted on AWS S3 and accessible via the digitalcorpora.org website.
Figshare hosts a 5.5 KB dataset analyzing three machine learning regression models. The data compares model performance on confinement loss and wavelength sensitivity metrics for enhanced plasmonic sensors. Author Sonia Akter contributed this analysis under a CC BY 4.0 license.
Student Attention and Teaching Behavior Data published on Kaggle. The dataset likely contains observations linking student engagement metrics with instructor actions. Its specific size, collection method, and temporal coverage are not detailed in the available metadata.