DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

Speech & Audio Datasets | DataSalon

All Categories

🎤

Speech & Audio

Speech recognition, text-to-speech, speaker identification, music classification, audio event detection

2,602 datasets

Speech & Audio

Coastal Current Measurements from New England Fixed Platforms

Seventeen years of ocean current data were collected by the U.S. Geological Survey using current meters deployed in New England coastal waters. The collection includes U and V velocities in cm/s, rotor speeds, current directions in degrees, and water temperatures. Data collection spanned from May 1975 to February 1992.

Time SeriesFixed PlatformNew England WatersCoastal HydrodynamicsOcean Currents+1

0 views

Speech & Audio

Chinese Music Performance Evaluation Dataset with Expert Scores

Expert scores and audio features for music assessment provide a structured evaluation of musical performances. The dataset likely contains quantitative metrics derived from audio recordings alongside subjective ratings from experts. Its origin and scale are unspecified.

TabularAudioMusic EvaluationAudio FeaturesBenchmarkMusic PerformanceExpert Assessment+1

0 views

Speech & Audio

Shona Speech Dataset for ASR and TTS Workflows

A cleaned, metadata-rich Shona speech dataset prepared through a reproducible data engineering pipeline. The dataset is derived from the google/WaxalNLP source, specifically the sna_asr subset, and was last updated on March 20, 2026. It is intended as a general-purpose standard corpus for downstream tasks.

AudioOPTIMIZED-PARQUETParquetSize Categories10 Kn100 KText To SpeechTask Categoriestext To SpeechLibrarypolarsLibrarydaskModalitytextAfrican LanguageLibrarymlcroissantLibrarydatasetsLicensecc By 40RegionusNatural Language ProcessingTask Categoriesautomatic Speech RecognitionAudio CorpusShona SpeechSpeech RecognitionAutomatic Speech RecognitionShona+1

0 views

Speech & Audio

Multimodal Music Genre Dataset: GTZAN with Lyrics

A multimodal dataset combining audio and text data for music genre classification. It likely contains audio features from the GTZAN benchmark dataset paired with corresponding song lyrics. The dataset is published on Kaggle, but its specific creation date and author are unknown.

AudioMultimodalLyricsMusic Genre+1

0 views

Speech & Audio

Dataset_train_xtts: Text-to-Speech Training Data

Dataset_train_xtts is a dataset for training text-to-speech models, published on Kaggle. The dataset's specific content, size, and origin are not detailed in the provided metadata. Further details about the data's collection method, author, and temporal coverage are unavailable.

AudioText To SpeechSpeech SynthesisAudio Training+1

0 views

Speech & Audio

English Speech Interview Audio for AI Training, 10,000+ Hours

10,000+ hours of interview audio and video sourced for AI training. The data is described as ethically sourced. The dataset is hosted on Kaggle, but details about the author, organization, and specific collection dates are unknown.

AudioMultimodalAi TrainingInterview+1

0 views

Speech & Audio

Multilingual LibriSpeech Spanish Phoneme: 100K+ Audio-Phonetic Pairs

Between 100,000 and 1,000,000 Spanish audio segments and transcriptions derived from LibriVox audiobooks. Created by Cnam-LMSSC and updated in March 2026, it extends the Multilingual LibriSpeech (MLS) corpus with machine-generated phonetic transcriptions.

AudioParquetTask Categoriestext To SpeechLibrarypolarsLanguage CreatorscrowdsourcedLibrarydaskModalityaudioModalitytextSize Categories100 Kn1 MLibrarymlcroissantLibrarydatasetsLicensecc By 40PhonemesSource Datasetsfacebookmultilingual LibrispeechRegionusTask Categoriesautomatic Speech RecognitionAnnotations Creatorsmachine GeneratedSpeechLanguageesMultilingualitymonolingual+1

0 views

Speech & Audio

data_tts_32: Text-to-Speech Audio Samples

A Kaggle dataset titled 'data_tts_32'. The dataset likely contains audio files and associated text for text-to-speech synthesis tasks. Its specific content, size, and origin are not detailed in the provided metadata.

AudioText To SpeechSpeech SynthesisAudio Generation+1

0 views

Speech & Audio

Invoice Dataset for Business Process Analysis

An invoice dataset published on Kaggle. The dataset likely contains records related to business transactions and billing. Specific details regarding its size, origin, and time period are not provided in the available metadata.

TabularAccountingBusiness FinanceInvoice+1

0 views

Speech & Audio

UK Live Music Booking Rates by City, Band Size, and Event Type

UK Live Music Booking Rates 2026 (May) contains 3,847 booking rates for live music in the UK. The data is organized by city, band size, and event type. The dataset was sourced from Kaggle, but the author, organization, and specific collection method are unknown.

TabularAudio🇬🇧 United KingdomLive MusicBooking Rates+1

0 views

Speech & Audio

Vietnamese Speech Recognition Corpus With 100+ Hours

32,267 audio samples totaling 103.18 hours of Vietnamese speech, curated for automatic speech recognition. The dataset, created by thanhnew2001, was last updated in February 2026. It is structured into 29,041 training and 3,226 development samples.

AudioMachine LearningVietnamese LanguageSpeech Recognition+1

0 views

Speech & Audio

XTTSv2 Audios: Text-to-Speech Audio Samples

XTTSv2Audios is a dataset of audio files likely associated with the XTTSv2 text-to-speech model. The dataset is hosted on Kaggle, but its specific contents, size, and creation details are not provided in the available metadata. Further details such as the number of samples, speaker diversity, and recording conditions require verification after download.

AudioText To SpeechSpeech SynthesisAudio Samples+1

0 views

Speech & Audio

Tts Male 70H: Male Voice Text-to-Speech Audio Samples

Tts Male 70H is a text-to-speech dataset published on HuggingFace by user vfdanil. The title suggests it contains audio samples of a male voice, likely for speech synthesis tasks. The dataset was last updated on April 22, 2026.

AudioText To SpeechSpeech SynthesisMale VoiceAudio Generation+1

0 views

Speech & Audio

Noise Reduction Audio Data

Kaggle hosts a dataset titled 'Noise reduction'. The dataset's content, size, and specific source are not detailed in the provided metadata. Its last update date and licensing information are also unknown.

AudioSignal ProcessingAudio ProcessingNoise Reduction+1

0 views

Speech & Audio

Numberblocks One Voice Dataset

Ayf3 published the Numberblocks One Voice Dataset on Hugging Face in April 2026. The dataset likely contains audio recordings related to the Numberblocks children's media franchise. Its specific content, size, and structure require verification after download.

AudioChildrens Media+1

0 views

Speech & Audio

Theodore Roosevelt Psychological Portrait and Political Analysis

Sarah Lyons Watts's book presents a psychological portrait of the 26th U.S. president, Theodore Roosevelt. The work analyzes his personal obsession with masculinity and its influence on national politics, as noted by contemporary figures like Woodrow Wilson. It is a textual analysis of Roosevelt's legacy, sourced from the paperswithcode platform.

TextHistoryNew DealWhite MutationPsychologyTheodore RooseveltArt HistoryLawIconPortraitPolitics Of The United StatesSociologyPolitical SciencePassionBiographyPolitics+1

0 views

Speech & Audio

Massachusetts Coastal Dive Sites from Recreational and Archaeological Sources

Coastal Massachusetts dive sites include reefs, wrecks, jetties, and breakwaters popular for SCUBA diving. Data points were compiled by the Massachusetts Office of Coastal Zone Management from the Board of Underwater Archaeological Resources and dive club listings. The Massachusetts Office of Coastal Zone Management updated this layer on July 2, 2007.

GeospatialGeospatial PointsRecreational DivingUnderwater SitesCoastal Massachusetts+1

0 views

Speech & Audio

Historical USACE Dredging Projects in Massachusetts Waters

Point locations document federal dredge projects by the US Army Corps of Engineers along the Massachusetts coastline. The data is historical, with records up to 16 December 1998. The dataset was compiled by the organization SCIOPS.

GeospatialDredging ProjectsHistorical InfrastructureMassachusetts CoastUsaceCoastal Engineering+1

0 views

Speech & Audio

Historical USACE Dredge Project Polygons for Massachusetts Waters

Polygonal extents document federal dredging projects by the US Army Corps of Engineers along the Massachusetts marine coastline. The dataset includes navigational channels, anchorages, harbors, beaches, and dikes, with records historical to December 16, 1998. It was compiled by the organization SCIOPS.

GeospatialUsace ProjectsHistorical GeospatialDredging ProjectsCoastal managementMarine Infrastructure+1

0 views

Speech & Audio

Massachusetts Boating Activity Arcs from 2005 Workshops

Geospatial arcs represent commercial, charter, and recreational boating uses within the Massachusetts Coastal Zone. The data delineates three distinct activity subtypes. It was compiled by SCIOPS from expert workshops held in Boston and Waquoit in June 2005.

GeospatialCoastal managementMaritime TransportationRecreationGeospatial Analysis+1

0 views

PreviousPage 53 of 130Next