Loading...
Loading...
Student performance, MOOC logs, knowledge tracing, standardized tests, learning analytics
13,421 datasets
A collection of the Diversity Teaching Beliefs Scale (DTBS), a multidimensional instrument assessing teacher beliefs. It measures three distinct dimensions: Acknowledging cultural differences (ACD), emphasizing cultural communalities (ECC), and socialization of the national culture (SNC). The data was used in three studies to validate the scale's psychometric properties and analyze belief co-occurrence patterns.
Child care licensing inspection data details sections evaluated, standards assessed, and violations found per activity. The dataset links to operations and activity records via Operation ID and Activity ID. It covers inspections, investigations, and assessments conducted by the City of Austin.
The Paderborn University Car Bearing Dataset is a collection of measurements for condition monitoring and fault diagnosis. It has been converted to a .csv format for accessibility. The dataset originates from research at Paderborn University.
This dataset classifies 229 Texas counties into five strategic tiers based on composite opportunity scores for university recruitment. It identifies 6 'Prime Target' counties with strong academic readiness, large student populations, and low current university presence. The analysis weights academic readiness (40%), market size (30%), current market share (20%), and test participation (10%).
Exam-violation-bilstm is a dataset hosted on Kaggle, likely containing time-series or sequential data for detecting academic misconduct. The dataset's title suggests it is designed for training Bidirectional Long Short-Term Memory (BiLSTM) neural networks. Its author, organization, and specific details like size and license are currently unknown.
Output data from a Life Cycle Assessment model details the environmental impact of compostable nappies across five stages: manufacture, transport, use, collection, and disposal. The dataset includes material inputs, energy consumption, and environmental outputs for analyzing end-of-life scenarios like landfill, incineration, and composting. It was published by the Environmental Information Data Centre.
2009-2012 fence diagram of Great Britain's bedrock geology comprises 121 cross-sections with an aggregate length over 20,000 km. Compiled by 14 expert regional geologists from the British Geological Survey, it integrates subsurface data to depths between 1.5 and 6 km. The dataset provides a consistent 3D structural context for regional studies and education.
British Geological Survey maps depict the spatial extent of principal UK coal resources, overlaying existing workings and potential new technologies. The project covers all onshore coalfields, including Northern Ireland, and includes data for 21 individual regions at a 1:100,000 scale. Work was initiated in April 2002 and completed in October 2003.
South African Department of Basic Education exam papers, likely for high school subjects. The dataset is hosted on Kaggle, but the specific subjects, years, and number of papers are unknown. The original author and last update date are not provided.
Titanic Dataset is a collection of passenger information from the RMS Titanic disaster, commonly used for introductory machine learning tasks. It is published on the Kaggle platform. The dataset's specific size, features, and last update date are unknown from the provided metadata.
A classic dataset for machine learning practice, published on Kaggle. It likely contains measurements for classifying iris flower species. The original Iris dataset is a foundational benchmark for classification algorithms.
Mt Unlearning Checkpoints is a dataset uploaded by harishm17 to Hugging Face, last updated on March 29, 2026. The dataset's title and platform tags suggest it contains saved model states related to the machine unlearning research domain. The specific content, scale, and structure require verification after download.
A field operational test of an early prototype Drowsy Driver Warning System was conducted by the National Highway Traffic Safety Administration and the Federal Motor Carrier Safety Administration. The final dataset for analysis consisted of 102 drivers from 3 for-hire trucking fleets using 46 instrumented trucks, containing nearly 12.4 terabytes of truck instrumentation, kinematic data, and video recordings for 2.4 million miles of driving. This dataset is described as the largest ever collected by the U.S. Department of Transportation.
Digitized treatments from a 2008 book chapter by paleontologist Peter Larson analyze variation and sexual dimorphism in Tyrannosaurus rex. The data is sourced from the Plazi repository and the original publication 'Tyrannosaurus rex, the tyrant king'. It likely contains detailed morphological measurements and observations for the species.
A package by Suman Kundu, Yurii S. Aulchenko, and A. Cecile J.W. Janssens provides functions to assess the performance of risk prediction models. It includes measures like c-statistic (AUC), Hosmer-Lemeshow test, net reclassification improvement (NRI), and integrated discrimination improvement (IDI). The package also contains functions for logistic regression analysis and to construct a simulated dataset with genotypes and disease status for evaluating genetic risk models.
A research brief synthesizing findings from over 150 studies on adolescent reproductive health in the United States. It was authored by Jennifer Manlove for Child Trends and focuses on factors influencing behaviors like sexual initiation, contraceptive use, and pregnancy. The brief includes a table of successful programs and approaches.
Teacher train files containing query and passage IDs from the MSMARCO-Passage collection, created for the paper 'Improving Efficient Neural Ranking Models with Cross-Architecture Knowledge Distillation'. Sebastian HofstΓ€tter and colleagues at TU Wien developed this resource to support research into distilling ranking knowledge between different neural architectures. The associated documentation and code are hosted on GitHub.
31 U.S. states and 1 city collect data on maternal attitudes, behaviors, and experiences before, during, and after pregnancy. The Pregnancy Risk Assessment Monitoring System (PRAMS) is a joint project between the CDC and state health departments, surveying between 1,300 and 3,400 women per state annually since 1987. Surveillance reports are available for years including 1995-2000, 2002, and 2007.
The prevalence package provides statistical tools for prevalence assessment studies. It includes Frequentist and Bayesian methods, requiring the JAGS (Just Another Gibbs Sampler) software for the Bayesian truePrev functions. The package was authored by Brecht Devleesschauwer.
City of Austin Planning Department provides a PDF report on the implementation status of adopted small area plans, such as neighborhood and station area plans. The dataset tracks the status of recommendations according to the name of the planning area, but statistics are no longer being updated.