Loading...
Loading...
Speech recognition, text-to-speech, speaker identification, music classification, audio event detection
1,907 datasets
2007 measurements collected along the Massachusetts and Maine coastal regions. The dataset contains oceanographic data, including chemistry, optics, temperature, and salinity/density, produced by the National Aeronautics and Space Administration. It is available in BIN and ISO file formats.
somu9's Hindi Tokens dataset contains 305,847 pre-extracted audio codec tokens for text-to-speech training. The data comprises 544.2 hours of Hindi audio, with an average sample duration of 6.4 seconds. It was last updated on June 2, 2026.
U–Pb dating of zircons from middle Permian tuffs in the Canning Basin reveals an apparent 1.7-million-year conflict with established spore-pollen zonation. The dataset, published in the Australian Journal of Earth Sciences in 2017, includes CA-IDTIMS ages and palynological zone information from core holes spanning 350–400 km. It highlights a potential local environmental influence on fossil assemblages and cautions against direct facies comparison.
Atmospheric measurements provide hourly averages of methane (CH4) from five sites and five-minute averages of methane and ethane (C2H6) from one urban site in Boston, Massachusetts. Data collection occurred from September 2012 to May 2020 using Picarro cavity ring down spectrometers and a laser spectrometer. The dataset was produced by ORNL_CLOUD, with background concentrations modeled using HYSPLIT trajectories and NAM meteorology.
Persian (Farsi) synthetic multi-speaker speech dataset for speaker diarization. It contains approximately 50 hours of audio across 5,000 tracks, built from utterances in the Shemo dataset and processed through a synthesis framework. The dataset was created by atiyehghm and last updated on Hugging Face in May 2026.
arg-spanish-tts is a unified, deduplicated speech corpus for Argentine Spanish (es-AR) containing 10,747 audio rows. The dataset was created by Kukedlc, who merged three public datasets and stripped cross-source duplicates. All audio is resampled to 24 kHz mono, totaling 12.18 hours from 65 unique speakers.
EvA Open Data provides audio clips paired with descriptive captions and instruction-based question-answer data. The audio is sourced from the AudioSet Strong Labels dataset and stored in parquet shards. The dataset was authored by SatsukiVie and last updated on Hugging Face in May 2026.
DNS5 Challenge data is a mirrored collection of audio files for speech enhancement tasks. It contains 245 hours of English, 95 hours of French, and 137 hours of German speech sourced from LibriVox, AudioSet, Freesound, OpenSLR26, and OpenSLR28. The dataset was converted to Opus format by user philgzl and last updated in May 2026.
ORNL_CLOUD provides the Site Averaged Flux Data: 1987 (Betts) Data Set from the 1987-1989 FIFE experiment. This dataset contains site-averaged product data collected by multiple principal investigators, structured in 30-minute time intervals for 1987 and covering the entire 1987-1989 period. The data is available in multiple file formats including HTML, PDF, PNG, BIN, ISO, ZIP, and TEXT.
An original dataset documents the permitting processes for locally-permitted wind and solar energy projects in Massachusetts. Created by Natalie Baillargeon, it contains data on permitting durations, project outcomes, and capacity. The dataset was last updated on April 8, 2026, and is shared under a CC-BY-4.0 license.
ChildTalk is a large-scale, publicly available multi-dialect Chinese child speech dialogue dataset. It addresses limitations in existing corpora, such as small size and lack of natural conversations, by providing full-length dialogue recordings. The dataset was created by yujie-ovo and was last updated on May 29, 2026.
SonoroNova-ES is a large-scale synthetic English-to-Spanish speech-to-speech translation dataset containing 329,764 utterances. It was constructed via cascade pipelines combining text-to-text translation models with neural text-to-speech engines, using source audio derived from the HiFiTTS-2 English audiobook corpus. The dataset features 1,315 unique speakers and provides a total of 961 hours of audio.
A Ukrainian-language speech dataset parsed from the Телебачення Торонто YouTube channel. Each sample consists of a short audio clip paired with its corresponding Ukrainian subtitle text, intended for automatic speech recognition research and education. The dataset was created by yuriilaba and was last updated on Hugging Face in May 2026.
Western Australia's Canning Basin provides data on apparent age conflicts in middle Permian stratigraphy. The dataset likely contains U-Pb zircon dates from tuffs and associated palynological zone information, published in a 2017 study by Mory et al. in the Australian Journal of Earth Sciences. It documents a 1.7-million-year discrepancy between CA-IDTIMS dates and established spore-pollen zonation.
Kansas, USA hosts this site-averaged dataset from Portable Automatic Meteorological Stations deployed during the 1987-1989 FIFE experiment. It contains 30-minute interval measurements of atmospheric and surface conditions. The dataset is provided by the National Aeronautics and Space Administration.
Voices in the Wild 2M is an automatic speech recognition dataset designed for robustness training and evaluation. The dataset contains audio files grouped by normalized acoustic subset, with fields for file paths and reference transcriptions. It was created by author zhifeixie and last updated on Hugging Face in May 2026.
1987 to 1989 data provides site-averaged daily neutron probe soil moisture measurements collected during the FIFE experiment. The dataset is a processed product where samples were averaged first by site and then by day. It is hosted by the ORNL_CLOUD organization.
1987 data from the FIFE experiment provides site-averaged daily neutron probe soil moisture measurements. The dataset contains product data where samples were averaged first for each site and then for each day. It is managed by ORNL_CLOUD and originates from a field campaign conducted from 1987 to 1989.
Kansas site-averaged gravimetric soil moisture data was collected during the 1987-1989 FIFE field campaign. This dataset contains only the 1988 product, where samples were averaged first by site and then by day. The data is managed by the ORNL_CLOUD organization.
Site Averaged Gravimetric Soil Moisture Data from the 1987 (Betts) dataset provides daily averages of soil water content collected during the 1987-1989 FIFE field campaign. The data represents site-averaged product samples from 1987 only. It is managed by the ORNL_CLOUD organization.