Loading...
Loading...
Student performance, MOOC logs, knowledge tracing, standardized tests, learning analytics
13,338 datasets
England's VITAE project investigated factors influencing teacher effectiveness over a three-year period from 2001 to 2005. The research, commissioned by the DfES and led by Christopher Day at the University of Nottingham, involved primary and secondary teachers of varying age and experience across a range of schools. It examined the interplay between teachers' professional and personal lives and their impact on pupil attainment during a period of significant educational policy change.
An R package providing utilities for computing model quality measures not directly available in R's base packages. It includes functions for metrics like R-squared, intraclass correlation coefficient, and root mean squared error, as well as checks for overdispersion and zero-inflation. The package, authored by Daniel LΓΌdecke, applies to a variety of regression models including generalized linear, mixed effects, and Bayesian models.
Field data collected by the Department of the Interior to examine diamond-backed terrapin (Malaclemys terrapin) estimation and commercial blue crab (Callinectes sapidus) trapping. The dataset includes metadata, CSV files, and spatial information provided as raster and vector data in a geodatabase. It was last updated on March 4, -2026.
PaTaRM-data is the training data for the PaTaRM series, a collection for aligning language models. The dataset contains two subsets: one with 35.6k samples for supervised fine-tuning (SFT) and another with 41.7k samples for reinforcement learning (RL). It was created by AIJian and last updated on April 1, 2026.
Xinyu Dou's meta-analysis aggregates results from 17 empirical studies examining the relationship between family socioeconomic status (FSES) and offspring creativity. The dataset, updated in March 2026, provides a structured synthesis of research findings in a 13.9 KB Excel file.
A dataset for predicting customer churn using machine learning techniques. The dataset originates from Kaggle, but its author, size, and specific contents are unspecified. Its last update date is unknown.
The Learning Agency Tools Competition Submissions dataset is hosted on Kaggle. The dataset likely contains entries from an educational technology competition. Specific details on the number of submissions, columns, and time period are unavailable.
890 Coursera course records scraped from the official website by an individual learner for a hackathon project. The dataset includes course titles, organizations, certificate types, ratings, difficulty levels, and student enrollment counts. The author shared the scraping code and an article about the dataset generation process.
6,864 zooplankton samples were collected over twelve years from 1977 to 1988 in the Gulf of Maine. This dataset, part of the NOAA NEFSC MARMAP program, supports analysis of spatial and temporal patterns in marine ecosystems. It was compiled for a biogeographic assessment of the Stellwagen Bank National Marine Sanctuary.
6,864 zooplankton samples were collected over twelve years from 1977 to 1988 in the Gulf of Maine. This dataset originates from the NOAA Northeast Fisheries Science Center's MARMAP program and was used for a biogeographic assessment of Stellwagen Bank National Marine Sanctuary. The data is available on multiple government data platforms.
Alwaha School data published on Kaggle. The dataset likely contains information related to student enrollment, academic performance, or administrative records. Its specific contents, scale, and origin require verification after download.
Marine reporting units for the Netherlands derived from the Marine Strategy Framework Directive. The dataset is used by the Netherlands to carry out assessments under Articles 8, 9, and 10 of the MSFD. It is published by the Dutch Ministry of the Interior and Kingdom Relations under a CC0-1.0 license.
IGAD Climate Prediction and Applications Center (ICPAC) provides a spatiotemporally continuous long-term Aridity Index dataset for Somalia. The index is an effective estimator of drought status. The dataset was last updated on 2026-03 17 and is available in GEOTIFF format under a CC-BY-4.0 license.
A dataset on student learning behavior, published on Kaggle. The specific number of records, features, and collection methodology are not detailed in the available metadata. Its content likely contains observations related to student activities and performance.
Facial emotion data likely intended for deep learning model training. The dataset is hosted on Kaggle, but its scale, collection method, and specific contents are not detailed in the available metadata. Its creation date and authorship are unknown.
Indicators for measurement and assessment of model fit quality, compiled by Guifeng Zheng. The dataset is a 9.5 KB XLS file last updated on March 25, 2026. It is licensed under CC-BY-4.0 and hosted on figshare.
Adigrat Town, Ethiopia, is the geographic scope for this dataset on secondary school students. It contains survey data from 599 students, assessing behavioral characteristics and mental health factors. The dataset was created by Haftom Tesfay Gebremedhin and published in 2024.
A 9.5 KB Excel file containing classification results for deep learning models, authored by Zahra Ahani. The dataset was last updated on March 25, 2026, and is shared under a CC-BY-4.0 license on figshare.
Managerial records for creating employee performance plans, conducting performance discussions, and finalizing appraisals within the U.S. Social Security Administration. The dataset was last updated on April 3, 2026. The data is published under an 'other-license-specified' license.
Whole of Afghanistan Assessment 2023 provides insights for the Humanitarian Needs and Response Plan process. The annual assessment enables longitudinal analysis of needs and severity across population groups and geographical areas. It is produced by the REACH Initiative.