Loading...
Loading...
Speech recognition, text-to-speech, speaker identification, music classification, audio event detection
1,903 datasets
A sample of 294 children, predominantly male (68.7%) with the largest age subgroup being 7–9 years (33.7%), was recruited for this study. The research, authored by Yanyan Lu and last updated in May 2026, uses Structural Equation Modeling to examine the direct effects of musical engagement on rehabilitation performance and the mediating role of non-cognitive skills. It provides empirical evidence for music-based interventions in contexts like ASD, ADHD, and intellectual disabilities.
LBA-ECO CD-10 reports vertical profiles of H2O vapor concentrations measured at the Para Western (Santarem) km 67 Primary Forest Tower Site in Brazil's Tapajos National Forest. Measurements were taken at eight heights from 0.91 to 62.2 meters from January 2002 through January 2006, with data averaged hourly. The dataset includes co-located eddy flux measurements of CO2 and H2O and a set of meteorological parameters.
Terrestrial Ecosystem Model (TEM) calibration data provides carbon and nitrogen pool sizes and fluxes for 16 globally distributed field sites representing biomes from tundra to tropical forest. The dataset, originally published in 1999 and maintained by the ORNL DAAC, was compiled from literature to calibrate a process-based model for estimating continental-scale biogeochemical cycles. Data files remain unchanged since original publication, with only documentation updates noted in 2026.
A dataset from an in-lab study of 21 adult participants with mild to moderate sensorineural hearing loss evaluating an Over-the-Counter hearing device. The study assessed speech recognition in noise, subjective listening effort, and real ear measures of the Nuance Audio Glasses. The data was authored by Paula Folkeard and last updated on May 25, 2026.
A 2026 meta-analysis by Jiayi Gu synthesizes evidence from 7 controlled studies involving 296 patients. The dataset contains statistical results comparing music intervention to control conditions on consciousness levels in patients with disorders of consciousness. It was published on figshare under a CC-BY-4.0 license.
Jiayi Gu's 2026 meta-analysis document on figshare reviews evidence from controlled studies on music intervention for patients with disorders of consciousness. The analysis includes seven studies comprising 296 patients, finding a significant improvement in consciousness levels with high heterogeneity. The document is a 2.6 MB DOCX file licensed under CC-BY-4.0.
A meta-analysis document reviewing seven controlled studies on the effect of music intervention on arousal promotion in patients with disorders of consciousness. The analysis includes data from 296 patients across four randomized controlled trials and three non-randomized studies. The document was authored by Jiayi Gu and last updated in May 2026.
53 adult male participants (15 with Beckers MD, 12 with Limb-Girdle MD, 12 with Fascioscapulohumeral MD, and 14 non-MD controls) completed assessments of sleep, body composition, quality of life, pain, fatigue, and function. The dataset includes objective 7-day 24-hour tri-axial accelerometer data and subjective sleep quality indices. Christopher Morse authored this dataset, last updated on 2026-05-28.
Jessica MacLean's 2026 study tests how auditory rhythmic cues affect speech perception by manipulating periodicity, predictability, rate, and phase. The dataset likely contains experimental results from a psychoacoustic task where participants identified concurrent vowel pairs preceded by rhythmic woodblock cues. It includes measures of identification accuracy, reaction times, and individual differences in musical pulse continuation and speech-in-noise perception.
20.4 KB of strontium isotope ratio and concentration measurements on cremated human remains from Roman and pre-Roman Nijmegen, Netherlands. The dataset, authored by Maura De Coster and last updated in May 2026, compares samples from rural, urban, and military contexts to investigate dietary practices and local interaction.
120,125 audio files totaling 143.88 hours comprise this dataset for Bangla speech analysis. BAS4R contains both authentic and spoofed speech from 200 native speakers across ten Bangladeshi districts. Al Arian Ahmad contributed this dataset to Harvard Dataverse, with a last update recorded on 2026-05-22.
A study by Steve Garwood analyzed 50 automated speech recognition (ASR) transcripts from lecture capture recordings. The analysis used word error rate (WER) to assess accuracy and identified factors affecting performance. Only 19 of 50 (38%) transcripts met the 99% accuracy standard required by the Americans with Disabilities Act (ADA).
A clinical trial dataset from a prospective 2x2 factorial-design study involving 120 patients with Grade II/III internal hemorrhoids. The study evaluated the individual and combined effects of Kangfuxin solution perianal warm moist compress and music therapy on postoperative anal fullness, symptom duration, and quality of life. The dataset was published by Fen Gao on figshare in May 2026 under a CC-BY-4.0 license.
AirMOSS P-band synthetic aperture radar data captures polarimetric backscattering coefficients (sigma-0) to estimate root-zone soil moisture. The dataset contains biannual flight campaign data from 2012 to 2015 across 10 North American study sites, including the Harvard Forest. It was collected by NASA's airborne instrument to investigate links between soil moisture variability and continental-scale carbon fluxes.
World Bank Group data from 2016 onward aims to address persistent gender data gaps. This dataset hosts the latest sex-disaggregated statistics covering demography, education, health, economic opportunities, public life, and agency for St. Kitts and Nevis. It is part of a broader initiative to provide more and better gender data for smart development policy.
130 participants rated 120 environmental sounds on valence and arousal and completed well-being questionnaires. The study included 40 hearing aid users and 90 individuals with self-reported normal hearing, the latter split into subgroups with no-to-minimal and some hearing difficulties. Dina Lelic authored this dataset, last updated on 2026-05-25.
Alexander Paul Burton's dataset documents a live, unedited rubato piano improvisation in F Sharp Dorian. The 1.1 GB release includes multiple file formats such as MP3, CSV, and XLSX, capturing a high-density performance across the C3 to C6 range. It is released under a CC-BY-4.0 license as open-access anti-algorithm training data.
Memo2496 is a music emotion recognition dataset containing 2,496 instrumental songs annotated by 30 music experts. The dataset provides valence-arousal labels and extracted acoustic features to support affective computing research. It was updated by Qilin Li in April 2026 and is hosted on figshare under a CC-BY-4.0 license.
U–Pb dating of zircons from middle Permian tuffs in the Canning Basin reveals a conflict with established spore-pollen zonation. The dataset includes an age of 267.04 ± 0.14 Ma from the M. villosa Zone, which is 1.7 million years younger than tuffs from the D. granulata Zone. This research was published by Mory et al. in the Australian Journal of Earth Sciences in 2017.
July 2013 to December 2014 data provides gridded footprint fields from the WRF-STILT Lagrangian particle dispersion model for two receptor sites in Boston, MA. The 1-km resolution footprints quantify the influence of upwind surface fluxes on measured CO2 and CH4 concentrations. This dataset is produced by the National Aeronautics and Space Administration using meteorological fields from WRF version 3.5.1.