Loading...
Loading...
Student performance, MOOC logs, knowledge tracing, standardized tests, learning analytics
13,269 datasets
Mobile phone data from Kaggle, curated by the TabArena team for a study on independent and identically distributed tabular data. The dataset is intended for classification tasks and originates from a 2018 Kaggle competition. The target variable values were renamed by the curators to be more descriptive.
TabArena curated this dataset for evaluating predictive models on independent and identically distributed tabular data. The intended task is binary classification, likely to predict customer quality. The original data was sourced from a 2020 Kaggle dataset by user Podsyp.
Kaggle user Arashnic curated this dataset in 2021 to predict whether data scientists will seek new employment. The TabArena team subsequently processed it for a study on evaluating predictive models for independent and identically distributed tabular data. Its intended task is classification.
A 2014 dataset curated by the TabArena team for evaluating predictive machine learning models on independent and identically distributed tabular data. It originates from a study on algorithm selection for Answer Set Programming solvers, with the intended task being classification. The dataset was sourced from OpenML and is associated with published research.
2400 instances of scattering parameters measured at 10.5 GHz to detect 10 types of contaminants in packaged food jars. The dataset, created by Luca Urbinati, is part of a series of five datasets each measured at a different microwave frequency. It contains 1200 uncontaminated and 1200 contaminated samples, with contaminants including metal, glass, and plastic objects placed at the surface or middle of the spread.
2400 instances of scattering parameters measured at 10.0 GHz to detect contaminants in packaged food jars. The dataset includes 1200 uncontaminated and 1200 contaminated samples across 11 classes of contaminants like metal, glass, and plastic. It was created by Luca Urbinati as part of a series of five datasets measured at different microwave frequencies.
2400 instances of scattering parameters measured at 9.5 GHz for detecting contaminants in packaged food jars. The dataset was created by Luca Urbinati and contains 1200 uncontaminated and 1200 contaminated samples across 11 classes of contaminants like metal, glass, and plastic. It is part of a series of five datasets each measured at a different microwave frequency.
2400 instances of scattering parameter measurements for detecting contaminants in packaged food jars. The dataset was created by Luca Urbinati using microwave sensing at 9.0 GHz and contains 30 attributes per sample. It includes 1200 uncontaminated and 1200 contaminated instances across 11 different contaminant classes.
Geoscience Australia Data provides a collection of papers published for the inaugural Great Barrier Reef Conference held at James Cook University in August-September 1983. The dataset is a legacy product with no abstract available, and its specific content and structure are not detailed. It was last updated on the data.gov.au platform on 2026-04-30.
200 CSV files contain numerical facial movement data for Parkinson's disease assessment, processed from raw video footage using OpenFace 2.0. The dataset, created by Jaratsaeng, Sitthatka, was last updated on April 22, 2026. Each file contains 714 columns of frame numbers, timestamps, confidence scores, and facial landmark coordinates.
A 4.6 MB MATLAB file containing example data for XTCAV measurements, authored by River Robles. The dataset was last updated on April 11, 2026, and is shared under a CC-BY-4.0 license. The specific number of rows and column definitions are not provided in the metadata.
The Agulhas-South Atlantic Thermohaline Experiment (ASTTEX) dataset provides measurements of heat, salt, and mass fluxes entering the South Atlantic Ocean via the Agulhas Retroflection. It includes hydrographic profile data (CTDO, XBT, XCTD) and bottle samples for salinity and dissolved oxygen collected during a 2003 research cruise. The experiment was conducted by researchers including Dr. Donna Witter of Kent State University and supported by NOAA.
The APOLLO LUNAR SAMPLES BUG OBSERVATIONS V1.0 dataset contains bidirectional reflectance distribution function measurements of Apollo lunar samples. The data was collected using the Bloomsburg University Goniometer (BUG) instrument and is provided by the National Aeronautics and Space Administration. The dataset was last updated on March 13, 2026.
Norfolk city real estate assessment and sales data for fiscal year 2024, provided by the city's Office of the Real Estate Assessor. It includes details on property characteristics, ownership, and valuation for parcels within the city. The dataset is updated daily on weekdays.
Annual data of school boards for MBO (vocational education) in the Netherlands. The dataset is published by the Ministry of the Interior and Kingdom Relations on the EU Open Data portal. The data is provided in an Excel (XLS) file format under a CC0-1.0 public domain license.
Catherine Park's 5.5 KB dataset on figshare compares the predictive performance of various machine learning models for suicide attempts. It was last updated in April 2026. The dataset is stored in an XLS file format.
Scar characteristics and assessment methods from studies on Platelet-Rich Plasma (PRP) therapy. The dataset is a 21.5 KB Excel file created by Virgilio BlandΓ³n and last updated in April 2026. It compiles information on scar evaluation techniques used in clinical research.
Survey data from 165 young adults engaged with a college success initiative in Chicago, used to examine overlapping forms of marginalization in further and higher education. The data addresses relationships between caregiving, economic strain, surveillance, family pressure, and academic outcomes. The study was authored by Kamis, Arnold and last updated on 2026-04-16.
HOI-Edit-44K is a large-scale dataset of 44,117 high-quality, paired examples for Human-Object Interaction (HOI) editing, created to address the scarcity of supervised data for this task. It was authored by jiuntian and released on Hugging Face in April 2026, with associated research presented at CVPR 2026. The dataset is designed to provide the necessary supervision for training models like OneHOI to perform identity-preserving HOI modifications.
Research commissioned by the Greater London Authority in September 2023 investigates the impact of rising living costs on London's adult education sector. The work is based on a survey, interviews, and focus groups with learners, providers, and third-sector organizations conducted between November 2023 and March 2024. It provides an account of financial pressures, noting that GLA data shows 17% of Londoners were 'struggling financially' and 30% were 'just about managing' in 2023.