Loading...
Loading...
Student performance, MOOC logs, knowledge tracing, standardized tests, learning analytics
13,101 datasets
Modelled fluvial flood depth data was created for the 1% annual chance of flooding situations as a by-product from the 2004 generalised modelling project. The Environment Agency produced this data using the JFlow two-dimensional hydrodynamic model on a 5x5m grid to fill gaps where no detailed local modelled data existed in 2004. This metadata record is for the AfA238 Flood Zone Depth Grid Dataset 2004, covering the Ordnance Survey National Grid reference SX.
Modelled fluvial flood depth data was created for the 1% annual chance of flooding situation in 2004 using the JFlow hydrodynamic model on a 5x5m grid. The Environment Agency produced this data to fill gaps in detailed local modelling for spatial planning flood zone definitions. The data covers the whole of England, but this specific download is for the Ordnance Survey National Grid reference TF.
Supplementary material and research data for a meta-ethnography review study on parental alienation. The underlying analysis synthesized ten qualitative studies, resulting in four themes about long-term psychological impacts. The dataset was authored by Ingrid รlfarnes Rรธysland and last updated on 2026-04-21.
Parcel records from Cook County, Illinois, include distances and counts of nearby spatial features like transit stops, schools, and foreclosures. The dataset is produced by the Cook County Assessor's Office and is updated annually with the current tax year's data. It is based on parcel centroids derived from official county shapefiles.
Structured curriculum data from secondary chemistry and university biochemistry programs, organized into analytical units and knowledge domains. The dataset is 67.2 KB in size, authored by Aleksandra Mikhailidi, and was last updated on 2026-04-09. It enables domain-level comparison and identification of alignment patterns between educational levels.
Raw experimental data supporting the research 'Explainable machine learning reveals a RELB-driven oncogenic and inflammatory network orchestrating.' The dataset, authored by jiaying Dai, is a 188.9 MB ZIP file published on figshare under a CC-BY-4.0 license and last updated on April 23, 2026.
Administrative data from municipal service records matched with apartment-level housing price data in downtown Shanghai. The dataset supports analysis of the relationship between neighborhood wealth and the inclusion of strategic misinformation in citizen petitions. It was authored by YU, CHITAO and last updated on 2026-04-23.
3.8 MB of detailed experimental results supporting the paper 'A meta learning and task adaptive approach for drug target affinity prediction'. The data, authored by Mengxuan Wan, is stored in an XLSX file and was last updated on 2026-04-23. It is shared under a CC-BY-4.0 license.
Replication data for a study on teacher training in Pakistan's public sector, authored by Zahra Mansoor. The dataset is hosted on the Economic Development and Cultural Change Dataverse and was last updated in June 2026.
Proteomic data from human liver cells, intended for drug activity profiling using machine learning. The dataset, authored by Shaon Basu, consists of 685.7 MB of DIA-MS input files in TSV and XLSX formats and was last updated on April 14, 2026.
A 2026 dataset from Central Highlands Water details recycled water fireplug locations within its service area. The data is provided in multiple geospatial formats including KML, GeoJSON, and CSV.
MCD43A2 Version 6.1 provides daily quality information for BRDF and albedo retrievals from the MODIS instruments on NASA's Terra and Aqua satellites. The dataset is produced by the LPCLOUD organization using 16 days of data temporally weighted to the ninth day of the retrieval period. It contains band-specific quality flags and ancillary data for MODIS bands 1 through 7.
Daily global data from 2000 to present provides the three model weighting parameters (isotropic, volumetric, and geometric) for deriving surface albedo and reflectance. The dataset is produced by NASA's LPCLOUD using 16 days of combined Terra and Aqua MODIS observations, temporally weighted to the ninth day of the retrieval period. It covers MODIS spectral bands 1-7 plus visible, NIR, and shortwave bands with associated quality layers.
Adult reproductive data for redbait (Emmelichthys nitidus) collected via midwater trawling during the spawning season. The dataset includes biological information on size, sex, and reproductive condition, with histological examination of ovaries providing spawning activity and batch fecundity details. It is hosted by the Australian Ocean Data Network and was last updated on 2026-04-10.
A 2023 gear trial from Heriot Watt University's Lyell Centre compared commercial and experimental whelk pot designs in the English Channel. All captured whelk (Buccinum undatum) were measured for total shell length using calipers. The data likely contains paired catch records from control and test pots with different escape gap designs.
A national geospatial dataset and mapping tool developed by the University of Wisconsin, Madison School of Medicine and Public Health to measure socioeconomic disadvantage at the neighborhood level across the U.S. It provides the Area Deprivation Index (ADI), a composite measure combining 17 indicators across income, education, employment, and housing quality domains. The dataset offers ranked scores, including national percentiles and state deciles, for neighborhood comparison.
Training datasets for the OmniVL-Guard safety guard model, accepted at ICML 2026. The collection includes multi-modal training data (22.1 GB), multi-modal reinforcement learning data (27.3 GB), and refined supervised fine-tuning examples (105 MB). The dataset was uploaded by SJJ0854 and last updated on May 11, 2026.
10,372 examples form a benchmark for generating programmatic teaching videos using the Manim library. The dataset contains 5,199 English and 5,173 Chinese examples, each pairing a natural-language instruction with a reference Manim answer. It was created by 'posprivacy' and last updated on 2026-05-03.
306 survey responses from Vietnamese adults form the empirical basis for a structural equation model analyzing financial behavior. The model integrates variables like family financial socialization, artificial intelligence, financial literacy, and digital trust. It was developed by Nguyen Quoc Anh to assess direct, mediating, and moderating effects on financial well-being.
Nguyen Quoc Anh's dataset contains survey results from 306 Vietnamese adults, used to test an integrated financial behavior model. It applies Partial Least Squares Structural Equation Modelling to assess direct, mediating, and moderating effects. The data examines relationships between financial socialization, technological factors, financial capability, financial behavior, and financial well-being.