Loading...
Loading...
Student performance, MOOC logs, knowledge tracing, standardized tests, learning analytics
12,457 datasets
21,459 stroke patient records, including 936 who developed post-stroke epilepsy, form a retrospective cohort for a machine learning study. The research developed a dual-tier screening framework to address severe class imbalance, achieving a model sensitivity of 0.907 and specificity of 0.998. Author Lijun Wu published the study on figshare in May 2026.
A retrospective cohort study of 21,459 stroke patients, including 936 who developed post-stroke epilepsy, used to develop an interpretable machine-learning framework. The dataset was created by Lijun Wu and last updated on 2026-05-21. It was designed to address severe class imbalance for clinical screening.
21,459 stroke patient records, including 936 who developed post-stroke epilepsy, were used to develop interpretable machine learning models. The dataset, created by Lijun Wu and last updated in May 2026, describes a dual-tier screening framework for a severely imbalanced clinical cohort. The primary model achieved a macro-AUC of 0.996, while a secondary alert model prioritized sensitivity.
Geoscience Australia collected underwater footage from 49 stations during a marine survey of the Leveque Shelf in May 2013. The survey aimed to assess the CO2 storage potential of the Browse Basin by looking for evidence of gas or fluid seepage and mapping seabed habitats. Data includes AVI video files, still images, and USBL navigation files providing location, time, and depth for each transect.
Panel data from 281 Chinese cities between 2005 and 2022, used to analyze the impact of Green Industrial Parks on urban industrial chain resilience. The study by Yihao Wang applies a double machine learning approach to a quasi-natural experiment. The dataset was last updated on 2026-06-01 and is shared under a CC-BY-4.0 license.
The Western Margin survey (GA survey #2476) collected geological, geophysical, oceanographic, and biological data from Australia's western continental margin between 25 October 2008 and 19 January 2009. A total of 44 video transects and 6,229 still photographs were acquired from water depths ranging from 831 to 4,827 meters. The voyage was conducted on the R.V. Sonne in collaboration with the Western Australia Geological Survey and the University of the Sea.
A pilot study conducted at Taishan University in China from September 2024 to January 2025. The research examined the feasibility and outcomes of a Dao Yin-based course for 38 psychologically vulnerable undergraduates. Quantitative data includes pre-post scores on the Symptom Checklist-90 (SCL-90) and perceived physical status.
1,481 individuals were assessed for visual acuity using the SightConnect mobile application at the LV Prasad Eye Institute. The dataset likely contains measurements comparing the app's near and distance acuity results against standard clinical assessments, with a mean difference of 0.08-0.09 logMAR. Authored by Payal Sangani and last updated in May 2026, this research supports the use of digital tools for remote eye care triage.
A literature-derived dataset of 1007 cases supports an integrated machine-learning framework for environmental remediation. The data encompasses 80 iron-based materials, 136 target pollutants, and 50 test organisms. Author Qi Chen developed this framework to quantitatively co-assess the performance and safety of remediation materials.
A qualitative research dataset explores resilience factors among teachers supporting learners with learning barriers. The study used a phenomenological design with purposive sampling of 10 teachers from a Gauteng Primary School. Data consists of transcribed, semi-structured video interviews analyzed thematically within Michael Ungar's Social Ecology of Resilience Theory framework.
Anonymized survey responses from 221 undergraduate students, postgraduate students, alumni, and lecturers in educational technology at a large university in Vietnam. Data were collected via an online questionnaire in April 2026 to develop and validate a measurement framework for AI-generated educational video quality. The dataset includes 76 variables covering demographics, prior experience, and Likert-scale responses across eight quality dimensions.
Building Complex Points - NSW Features of Interest Category - GDA2020 Service is a point feature class defining groups of buildings and associated facilities functioning as a unit. The dataset is provided by Spatial Services (DCS) and was last updated on 2026-05-18. It includes themes such as Community Facility and Education Facility, with examples like ambulance stations, libraries, and schools.
A point feature dataset of pre-school facilities in New South Wales, Australia, positioned within their cadastral parcels. The data is part of the NSW Features of Interest Category and has been updated to the GDA2020 national spatial standard. It was initially published on 29/09/2021 by Spatial Services, a business unit of the Department of Customer Service NSW.
A dataset from a study of 266 Thai secondary school students aged 13β19, examining relationships between Chinese media content preferences, perceived usefulness, and self-perceived learning effectiveness on YouTube. The data was collected by Binle Lai and last updated on 2026-05-19. It is a small dataset of 18.3 KB, stored in a DOCX file.
Eat Breathe Thrive for Co-Occurring Eating Disorder and Trauma-related Stress Symptoms: A Randomized Controlled Trial dataset evaluates the efficacy of the Eat Breathe Thrive program with community participants. The dataset is 91.5 KB in size and was last updated on 2026-05-17. Authors from the University at Buffalo, Veterans Affairs Palo Alto Healthcare System, Harvard Medical School, and the Eat Breath Thrive Foundation for Eating Disorders share these materials for scholarly review and transparency.
Ethnographic and eco-sensory data from an ESRC-funded project conducted in a secondary school in Liverpool between March 2021 and February 2023. The 4.7 GB collection includes 360 photographs, LiDAR scans, interviews, flow animations, AR videos, and participatory workshop outputs from 14 Year 12 students. It was created by Laura Trafi-Prats and contains speculative maps, 3D prototypes, and related research artifacts.
213 caregivers of preschool children with Autism Spectrum Disorder were surveyed in Urumqi between December 2023 and October 2024. Song Chen published this dataset to identify factors associated with caregiver psychological distress using LASSO regression and random forest models. The resulting model achieved an AUC of 0.87 on the test set.
Spatial Services provides a point dataset of high schools across New South Wales, Australia, updated to the GDA2020 geodetic standard. The data includes both public and private institutions catering to students aged 12 to 18. Feature positions are mapped within cadastral parcels and were captured from sources at scales ranging from 1:500 to 1:250,000.
724 Industrial Engineering student survey responses from public and private universities in Ciudad JuΓ‘rez, Mexico, collected for a 2026 doctoral thesis. The data, provided by Luis Fernando Alvarez Saucedo, operationalizes the Digital Matthew Effect to examine digital inequalities, gender gaps, and academic AI appropriation. It includes raw data, codebooks, and R scripts for analysis.
219 public places across six wards in Purbakhola Rural Municipality, Palpa, Nepal, were observed for compliance with smoke-free laws from May 7 to May 16, 2025. The dataset, created by Bhakta Bahadur KC and shared under a CC-BY-4.0 license, measures compliance using six indicators including active smoking, signage, and ashtray presence. Compliance was high in institutional settings like health facilities (100.0%) but low in the hospitality sector, with only 45.0% of restaurants free from indoor smoking.