DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

Speech & Audio Datasets | DataSalon

All Categories

🎤

Speech & Audio

Speech recognition, text-to-speech, speaker identification, music classification, audio event detection

2,572 datasets

Speech & Audio

Site Averaged Flux Data from 1988 FIFE Experiment

NASA's 1988 Betts dataset provides site-averaged atmospheric and soil flux measurements from the FIFE field campaign. It contains data aggregated in 30-minute intervals for the single year of 1988. The dataset was collected by multiple principal investigators to study land-atmosphere interactions.

TabularTime SeriesZIPTextATMOSPHERIC RADIATIONHeat FluxLAND SURFACESoil Heat Budget+1

0 views

Speech & Audio

Site Averaged Neutron Soil Moisture Data from 1988 FIFE Experiment

Site Averaged Neutron Soil Moisture Data: 1988 (Betts) contains daily site-averaged neutron probe soil moisture measurements from the 1987-1989 FIFE field campaign. The dataset includes only measurements from the 1988 season. NASA is the authoritative organization responsible for this data.

TabularZIPTextFife ExperimentLAND SURFACESoil MoistureEarth Science+1

0 views

Speech & Audio

Conflicting Permian Geochronology and Palynology Ages from the Canning Basin

U-Pb zircon dating from the Canning Basin reveals a 1.7-million-year age conflict between tuffs in non-marine and marginal-marine facies, challenging established spore-pollen zonation for the middle Permian. This dataset likely contains geochemical and stratigraphic data from chemical abrasion-isotope dilution thermal ionisation mass spectrometry (CA-IDTIMS) and argon-argon dating. The findings suggest facies-specific palynofloral influences complicate correlations within the Roadian–Wordian stages.

TabularCanning BasinGeochronologyBiostratigraphyPalynologyU Pb DatingSedimentary BasinLarge ScalePermianPermian PeriodUranium Lead Dating+1

0 views

Speech & Audio

Music Training and Reading Skills in Dyslexia from a Randomized Control Trial

A randomized control trial investigates the impact of music training on phonological awareness and reading skills in children with developmental dyslexia. The data supports the article by Flaugnacco, Lopez, Terribili, Montico, Zoia, and Schön. It is available via the paperswithcode platform under an Open Access license.

TabularAudioMusic TrainingReading SkillsRandomized Control TrialDevelopmental DyslexiaPhonological Awareness+1

0 views

Speech & Audio

Neural and Physiological Data from Participants Listening to Affective Music

Scientific Data Curation Team provides metadata for a dataset of neural and physiological recordings. The data descriptor likely contains human-readable and machine-readable metadata files. The dataset's specific scale, such as participant count or recording duration, is not detailed in the provided metadata record.

AudioMultimodalAffective MusicHuman SubjectsNeurosciencePhysiological Data+1

0 views

Speech & Audio

ERP Data on Musical Emotion Priming for Concrete and Abstract Words

A figshare-hosted dataset from a study by Lili Ming, last updated in May 2026. It contains event-related potential (ERP) data from two experiments investigating how emotional music primes affect the processing of concrete and abstract words. The dataset is small, at 26.7 KB, and is stored in an XLSX file.

TabularAudioExcelPsycholinguisticsERPLanguage ProcessingMusic EmotionCognitive neuroscience+1

0 views

Speech & Audio

Tamil Podcast Audio Dataset with 3,315 Hours of Processed Speech

A 3,315-hour collection of processed Tamil podcast audio recordings, part of a larger multilingual corpus of 57,568 hours across 12 languages. Created by InfoBayAI and last updated in June 2026, the dataset is designed to support the development of speech and conversational AI systems. It captures real-world interactions across diverse topics and formats.

AudioMultilingual SpeechLarge ScaleTamil LanguagePodcast AudioSpeech Recognition+1

0 views

Speech & Audio

J-Pop Music fMRI Pilot Study: Behavioral and Brain Activity Data

Five participants completed a visual memory task under three auditory conditions during fMRI. Behavioral recall accuracy, subjective focus ratings, and region-of-interest brain activity in the anterior insula and temporo-occipital cortex were descriptively examined. The exploratory pilot study, authored by Yoshiko Tojo, suggests a paradigm for investigating affective music and memory.

TabularAudioMemory RetrievalBenchmarkJ PopFmriBehavioral StudyMusic Cognition+1

0 views

Speech & Audio

High Time Resolution New Particle Formation Rates in Pittsburgh, 2023

125.8 MB of code and data replicates figures for a 2023 atmospheric study in Pittsburgh. Darren Cheng published this dataset under a CC-BY-4.0 license on figshare. The data includes high time resolution modeled and measured new particle formation rates.

TabularTime SeriesCSVTextAtmospheric ScienceAir QualityPittsburghParticle Formation+1

0 views

Speech & Audio

Median Cortisol Levels Before and After Participatory Live Music Practice

A small dataset of 5.5 KB in XLS format, containing median levels of CS and CF before and after a participatory live music practice. Created by Nina M. van den Berg and last updated on June 1, 2026, it is shared under a CC-BY-4.0 license on figshare.

TabularAudioExcelMusic TherapyPsychologyBiomarkersCortisolParticipatory Music+1

0 views

Speech & Audio

SinTechSVS: Phoneme-Level Singing Technique Annotations

A phoneme-level manual annotation dataset for singing techniques, covering pitch and timbral dimensions. The dataset was produced as part of the SinTechSVS project by the NUS Sound and Music Computing Lab. It is associated with a paper published in IEEE/ACM TASLP 2024.

TabularAudioMusic Information RetrievalSinging TechniquePhoneme AnnotationVoice Synthesis+1

0 views

Speech & Audio

Punjabi Podcast Audio Dataset with 4,840 Hours of Processed Speech

4,840 hours of processed Punjabi podcast audio form part of a larger 57,568-hour multilingual collection. The dataset captures real-world interactions across diverse topics and formats, designed to support speech and conversational AI systems. It was created by InfoBayAI and last updated on HuggingFace in June 2026.

AudioPunjabi LanguageMultilingual SpeechLarge ScalePodcast AudioSpeech Recognition+1

0 views

Speech & Audio

Arabic Podcast Audio Dataset with 6,024 Hours Across 12 Languages

A large-scale collection of 6,024 hours of processed Arabic podcast audio recordings, containing 57,568 hours of processed podcast audio recordings across 12 languages. It was created by InfoBayAI and last updated on 2026-06-08. The dataset captures real-world interactions across diverse topics and formats.

AudioArabic SpeechMultilingual SpeechSpeech AiLarge ScalePodcast Audio+1

0 views

Speech & Audio

ASR Leaderboard Longform: Three Benchmark Test Sets for Speech Recognition

ASR Leaderboard Longform provides three standardized benchmark test sets—Earnings-21, Earnings-22, and TED-LIUM—for evaluating longform automatic speech recognition models. The dataset is hosted by hf-audio on Hugging Face and was last updated on June 11, 2026. It is formatted as Parquet files for efficient loading via the Hugging Face datasets library.

AudioBenchmarkLongform AudioSpeech EvaluationAutomatic Speech Recognition+1

0 views

Speech & Audio

Gujarati Podcast Audio Collection with 2,471 Hours of Processed Speech

A large-scale collection of 2,471 hours of processed Gujarati podcast audio recordings, part of a broader multilingual corpus of 57,568 hours across 12 languages. The dataset was created by InfoBayAI and last updated in June 2026. It captures real-world interactions across diverse topics and formats to support speech AI development.

AudioMultilingual SpeechLarge ScaleGujarati LanguagePodcast AudioSpeech Recognition+1

0 views

Speech & Audio

Bengali TTS Folderized Parquet Stage 1: Audio and Transcription Dataset

A cleaned and organized Parquet version of the bengali-tts-combined dataset, structured by speaker folders. The dataset includes audio chunks, their durations, and corresponding Bengali transcriptions, sourced from multiple speakers and videos. It was created by author 'smam' and last updated on 2026-06-15.

TabularAudioText To SpeechBengaliSpeech SynthesisAudio Corpus+1

0 views

Speech & Audio

St. Kitts and Nevis Urban Development Indicators from World Bank

World Bank Group data on urban development for St. Kitts and Nevis. The dataset likely contains indicators on urbanization, traffic, congestion, and air pollution, sourced from the United Nations Population Division, World Health Organization, and other international bodies. It was last updated on 2026-04-28 and is available under a CC-BY-4.0 license.

TabularCSVUrban DevelopmentWorld BankHealthcareFinanceEnvironmental Strain+1

0 views

Speech & Audio

St. Kitts and Nevis Infrastructure Indicators from World Bank Sources

World Bank Group data compiled from international sources like the International Road Federation and the International Telecommunications Union. This dataset likely contains indicators on water, sanitation, energy, housing, and transport infrastructure for the country of St. Kitts and Nevis. The data was last updated on 2026-04-28 and is shared under a CC-BY-4.0 license.

TabularCSVEconomic DevelopmentWorld BankHealthcareFinanceInfrastructureTransport EnergyPoverty Reduction+1

0 views

Speech & Audio

St. Kitts and Nevis Health Indicators from World Bank and UN Agencies

World Bank Group data on health systems, disease prevention, and population dynamics for St. Kitts and Nevis. The dataset covers topics including immunization, sanitation, safe drinking water, reproductive health, and nutrition. Data are aggregated from sources like the United Nations Population Division, WHO, UNICEF, and UNAIDS.

TabularCSVHealth IndicatorsPopulation DynamicsHealthcareDisease PreventionFinanceSt Kitts NevisPublic Health+1

0 views

Speech & Audio

St. Kitts and Nevis Environmental Indicators from the World Bank

World Bank data on natural and man-made environmental resources for St. Kitts and Nevis. The dataset likely contains indicators covering forests, biodiversity, emissions, and pollution, sourced from the World Bank's data portal. It was last updated on 2026-04-28.

TabularCSVForestsPollutionFinanceEmissionsEnvironmentBiodiversity+1

0 views

PreviousPage 11 of 129Next