Loading...
Loading...
Student performance, MOOC logs, knowledge tracing, standardized tests, learning analytics
13,062 datasets
2024-2025 school year progress report ratings for all Chicago Public Schools, based on data from the 2023-2024 academic year. The dataset includes metrics on standardized test performance, school surveys, and award status. It was published by the City of Chicago and is updated annually.
A list of contractor licenses issued for registry assessment and remediation within New York State. The dataset includes business details, license status, and compliance flags such as debarment and wage assessments. It is provided by data.ny.gov and was last updated on April 3, 2026.
9,000 examples for training large language models to act like professional music producers in Ableton Live 12 Suite. The dataset was created by author gss1147 and was last updated on May 2, 2026. It covers topics such as drum programming, bass sound design, mixing, mastering, and live performance setup.
Data.calgary.ca provides location and administrative information for public schools and post-secondary institutions in Calgary. The dataset includes columns for school name, address, phone, grade levels, and administrative boundaries. It is used to power the public Calgary Education map application.
5,000 original synthetic chat examples are designed to train a Pokemon-world roleplay assistant. The dataset, created by clarkkitchen22 and last updated in May 2026, teaches immersive scene construction and character-aware replies without copying existing transcripts. It also includes rejected candidates and preference pairs for advanced training techniques like DPO.
A 119.4 MB dataset from figshare, last updated March 22, 2026, by author Sudipta Paul. It contains data for developing a machine learning interatomic potential covering the entire compositional space of NaCl-UCl3 molten salt. The dataset is intended to enable accurate, low-cost calculation of thermophysical properties like density, viscosity, and thermal conductivity.
Murat Polat's 2026 study conceptualizes managerial literacy as an integrated leadership capacity for navigating accountability-driven and data-informed education systems. The dataset is 5.3 MB in size and shared under a CC-BY-4.0 license on figshare. Its specific contents are described as supporting the study's conceptual framework.
UNHCR conducted a participatory assessment in July-August 2021 to map protection gaps for refugees and asylum-seekers in Syria. The assessment involved 80 focus group discussions across 11 governorates, with participation from 712 persons of concern. Refugees identified key challenges and provided recommendations to UNHCR.
A quarterly survey series monitors changes in refugee vulnerability in Jordan throughout 2022. UNHCR collected information repeatedly from the same refugee families to examine household-level variations in economic vulnerability, food security, shelter, WASH, and health. The assessment expanded in Q3 2022 to include refugees from the Azraq and Zaatari Syrian refugee camps.
CTD data from nine R/V WECOMA cruises along the northwest Pacific coastline, collected by Adriana Huyer of Oregon State University for the Slope Undercurrent Study. The dataset provides high-resolution vertical profiles of temperature, salinity, density, and other oceanographic parameters. Data is processed to the NODC standard High-Resolution CTD/STD (F022) format.
Part of the VMREFTAB reference tables for the VICMAP suite of products, this dataset is published by the Department of Transport and Planning. It was last updated on 2026-04-09 and is available under a CC-BY-4.0 license. The data likely contains property assessment information for the state of Victoria, Australia.
Massachusetts Department of Elementary and Secondary Education (DESE) publishes cohort graduation outcomes for public schools. The data tracks percentages of students graduating within 4 or 5 years, dropping out, attaining a GED, or being excluded. It is sourced from the educationtocareer.data.mass.gov platform and was last updated on March 6, 2026.
Norwegian lower secondary school pupils (n=398) from three municipalities in Telemark, Nordland, and Vestland counties participated in a language classification experiment. The dataset contains quantitative task data on accuracy and reaction times for classifying standard vs. dialectal Norwegian sentences, plus qualitative background questionnaire data from a subset of 352 participants. Anya Vinichenko contributed this replication data, which was last updated on 2026-04-21.
Lead testing in school drinking water sampling and results information reported by each NYS public school and Boards of Cooperative Educational Services (BOCES) for Compliance Period 2023-2025. The data is mandated by Public Health Law Section 1110 and NYS Department of Health regulation 10 NYCRR 67-4. It was last updated on February 26, 2026.
Educational materials for speech and language acquisition in autism, created by LSL-datasets. The dataset contains image and video samples of action verbs and verb+noun pairs. It is designed to support machine learning tasks related to visual understanding and language grounding.
Data protection impact assessments (DPIAs) published by the London Borough of Camden to identify and mitigate privacy risks in data collection, use, storage, and disclosure. The dataset is published in accordance with the Council's Data Charter and the GDPR/Data Protection Act 2018. Its last recorded update was on 2026-04-23.
Quantitative PCR data from a study examining transcript level alterations in placental cytokines and chemokines following prenatal dexamethasone exposure. Noriko Nakamura published this dataset on figshare in April 2026 under a CC-BY-4.0 license. The data was generated from humanized mice injected with dexamethasone or saline on gestation days 10–14, with placentae collected on day 18.
Viviana Galarza's 135.8 KB PDF file is a PRISMA-based screening matrix for a systematic literature review. The review focuses on the integration and use of generative artificial intelligence in university teaching within Ibero-America. The dataset was last updated on April 21, 2026.
Estimates of significant wave height and period, together with tidal current speed over a semi-lunar cycle, were used to predict areas on the Australian continental shelf where unconsolidated sediment was mobilised. These sediment-entraining processes were examined independently to quantify their relative importance. The dataset originates from the Australian Ocean Data Network.
East Sao Vicente Municipality, Cabo Verde, contains a geospatial dataset mapping flood impact from a mudflow event observed on 14 August 2025. The analysis covers approximately 35 km², identifying about 3 km² of affected land, 1,000 impacted buildings, and 15 km of affected roads. It was produced by the United Nations Satellite Centre (UNOSAT) using Pleiades very high-resolution satellite imagery.