Loading...
Loading...
Speech recognition, text-to-speech, speaker identification, music classification, audio event detection
1,907 datasets
Data from two follow-up studies to Chandler and Pronin (2012) investigating the effects of movement on thought speed and subsequent risk-taking behavior. The dataset includes raw data from a Fitts' tapping task (FittsBart), a lower limb tapping task (BodyBart), and the Balloon Analogue Risk Task (BART) for measuring pumping behavior, along with PANAS measures. The 5.6 MB dataset was authored by Clare MacMahon and last updated on 2026-05-25.
Cambodian cultural speech data comprising 134.6 hours of manually curated speech-text pairs in the Khmer language. The dataset was created by DDD-Cambodia using eight native speakers and was last updated in May 2026. Recordings average 8.54 seconds in length and include speaker metadata such as gender, age group, and origin city.
37,000 km² of Yukon Territory are underlain by potential coal-bearing rocks from Mississippian to Tertiary periods. This inventory, produced by the Government of Yukon, documents coal occurrences in seven distinct geological areas and includes a 1:2,000,000-scale map. The extent of deposits is largely unknown, as detailed examination has been limited.
A geological reconnaissance report describes the Toobally fault and surrounding rock formations in northern Toobally Lake, Yukon. The report identifies a newly proposed Toobally Formation diamictite estimated at 1800 m thick and an 850-m-thick basalt succession. It was published by the Government of Yukon and last updated in April 2026.
A 2015 dataset from the Queensland government's State of the Environment reporting describes the composition of litter. It notes that cigarette butts are the most common type of litter, despite constituting a small volume. The dataset was published by the Department of Environment, Tourism, Science and Innovation and is available under a CC-BY-4.0 license.
22 healthy adults participated in a real-time fMRI neurofeedback experiment using a novel musical interface. The dataset includes pre- and post-session questionnaire results assessing mood and subjective experience, alongside neuroimaging data from a 50-minute MRI session. The research was authored by Alexandre Sayal and shared on figshare with a CC-BY-4.0 license.
Nineteen participants with moderate to severe Alzheimer's disease from four nursing homes participated in a single-group intervention study. The data, published in 2026, includes assessments of social engagement, episodic memory, observed emotion, and verbal interactions collected at baseline, during nine workshops, post-intervention, and at a one-month follow-up. The dataset is a 102.8 KB PDF file containing the study's data sheet, authored by Mikael Genguelou.
Nineteen voluntary residents with moderate to severe Alzheimer's disease from four nursing homes participated in a single-group intervention study. The data likely contains assessments of social engagement, episodic memory, and observed emotions collected at baseline, three points during the intervention, post-intervention, and a one-month follow-up. The dataset was authored by Mikael Genguelou and last updated on April 22, 2026.
Approximately 187 hours of Arabic speech recordings and transcripts derived from publicly available lectures, interviews, television appearances, and talks by Dr. Mostafa Mahmoud. The dataset was created by oddadmix to support Arabic speech technology research and development. It was last updated on the platform in June 2026.
A 1-hour sample of labeled speech data for the Ibibio and Efik languages. The data was collected from native speakers in Akwa Ibom, Nigeria by Scuba, a voice data infrastructure company. The sample was posted on Hugging Face on May 19, 2026.
2.28 million unique audio clips are organized into streams for training audio-language models and agents. The dataset is structured into six task-specific subsets, including streams for audio understanding and real-time automatic speech recognition. It was created by zhifeixie and last updated on June 3, 2026.
Plastic items were the most common littered items in Queensland during the 2018–19 period, replacing cigarette butts. The data, provided by the Queensland Department of Environment, Tourism, Science and Innovation, highlights the significant environmental load of cigarette butts despite their small volume. It was last updated on May 12, 2026.
A Persian Farsi text-to-speech dataset containing professional single-speaker narration recordings. The dataset was created by author amir0907 and was last updated on the Hugging Face platform in May 2026. It is designed for training TTS models.
A 2026 study by Sun Sun Yap on figshare investigates heart-rate synchrony between a music therapist and 11 in-patients during neurorehabilitation sessions. The dataset includes dyadic heart-rate data, session videos, and notes, focusing on moments of interest within therapy interventions averaging 25.62 minutes. It explores relationships between physiological synchrony, nonverbal synchrony, and patient therapy readiness.
Approximately 0.0012 million km of roads are mapped in OpenStreetMap for Saint Kitts and Nevis. This dataset, created by HeiGIT and last updated in March 2026, classifies road surfaces as paved or unpaved using a hybrid deep learning approach that augments OSM data with Mapillary imagery and urban layers.
SMAPVEX19-22 campaign data captures L-band radiometer measurements over a red oak forest at Harvard Forest, Massachusetts, from late April to mid-October 2019. The dataset includes concurrent in-situ measurements of canopy leaf water potential, dielectric constant, soil moisture, temperature, and tree xylem properties. Its primary goal is to study the sensitivity of L-band vegetation optical depth (VOD) to changes in vegetation water potential over a growing season.
FactShield is a dataset containing experimental artifacts for a preliminary evaluation of a modular pipeline for automatic claim verification in audiovisual content. The dataset was authored by Fabiann Barbosa and last updated on May 23, 2026. It is a small dataset of 35.6 KB, available in ZIP and XLSX formats under a CC-BY-4.0 license.
A clinical study of 51 native Mandarin-speaking cochlear implant users, testing speech perception across five audio processor configurations. The dataset includes monosyllabic word, disyllabic word, and sentence recognition scores in quiet and noise conditions. The research was authored by Kailong Yin and published on figshare in April 2026.
Stuart D. Blacksell's dataset on figshare summarizes key scientific contributions and public health impact from long-term rickettsial research in Asia. The data is stored in an XLS file of 9.5 KB and was last updated on 2026-05-26. The dataset is licensed under CC-BY-4.0.
DatarrX created a Burmese Synthetic Speech Corpus designed to advance Text-to-Speech systems and speech recognition for the Burmese language. The dataset is described as high-fidelity and manually curated to provide natural, native-sounding audio. It was last updated on 2026-05-31.