DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

Speech & Audio Datasets | DataSalon

All Categories

🎤

Speech & Audio

Speech recognition, text-to-speech, speaker identification, music classification, audio event detection

2,587 datasets

Speech & Audio

Yoruba BibleTTS Aligned: Text and Audio for Speech Synthesis

A Kaggle dataset titled 'Yoruba BibleTTS Aligned - 1TH'. The title suggests it contains Yoruba language text and corresponding audio, likely aligned for text-to-speech model training. The dataset's author, size, and specific contents are unknown from the provided metadata.

TextAudioText To SpeechSpeech SynthesisAudio AlignmentYoruba Language+1

0 views

Speech & Audio

Yoruba BibleTTS Aligned - 2PE: Yoruba Bible Text-to-Speech Data

Yoruba BibleTTS Aligned - 2PE is a dataset hosted on Kaggle. The title suggests it contains audio recordings and corresponding text from the Yoruba Bible, likely aligned for text-to-speech model training. The dataset's author, organization, size, and other metadata are unknown.

TextAudioText To SpeechYorubaSpeech SynthesisAudio AlignmentBible+1

0 views

Speech & Audio

Yoruba BibleTTS Aligned: Text and Audio for Speech Synthesis

Yoruba BibleTTS Aligned is a dataset for text-to-speech research, likely containing aligned audio recordings and corresponding text transcripts. It was published on Kaggle, but the author, organization, and specific data volume are unknown. The dataset's last update date is also unspecified.

TextAudioText To SpeechSpeech SynthesisAudio AlignmentYoruba Language+1

0 views

Speech & Audio

Yoruba BibleTTS Aligned: Text and Audio for Speech Synthesis

Yoruba BibleTTS Aligned is a dataset hosted on Kaggle. The title suggests it contains aligned text and audio data, likely for training text-to-speech models for the Yoruba language. The specific version '2TH' may indicate a particular subset or alignment method, but detailed metadata is unavailable.

TextAudioText To SpeechSpeech SynthesisAudio AlignmentYoruba Language+1

0 views

Speech & Audio

Yoruba BibleTTS Aligned - 2TI: Aligned Audio and Text for Speech Synthesis

Yoruba BibleTTS Aligned - 2TI is a dataset published on Kaggle. The title suggests it contains audio recordings and corresponding text for the biblical book of 2 Timothy in the Yoruba language, likely aligned for text-to-speech model training. Metadata is minimal; the specific number of audio clips, recording quality, and alignment method are unknown and require verification after download.

TextAudioText To SpeechAligned AudioSpeech SynthesisYoruba LanguageBible Text+1

0 views

Speech & Audio

Yoruba BibleTTS Aligned - 1TI: Aligned Audio and Text for Speech Synthesis

Yoruba BibleTTS Aligned - 1TI is a dataset published on Kaggle. The title suggests it contains aligned audio and text data, likely for the First Epistle to Timothy (1TI) from the Bible in the Yoruba language. The dataset's specific size, format, and creation details are not provided in the available metadata.

TextAudioText To SpeechAligned AudioSpeech SynthesisYoruba LanguageBible Text+1

0 views

Speech & Audio

Yoruba BibleTTS Aligned - HEB: Speech Synthesis Corpus

Yoruba-language speech data likely aligned with biblical text, sourced from the HEB (Holy Bible) corpus. The dataset is hosted on Kaggle, but its creator, size, and specific contents are not detailed. Columns and sample data are unavailable for review.

TextAudioText To SpeechSpeech SynthesisAligned DataYoruba LanguageBible Text+1

0 views

Speech & Audio

Yoruba BibleTTS Aligned - JAS: Speech Synthesis Dataset

Yoruba language audio and text data, likely aligned for text-to-speech model training. The dataset title suggests it contains Bible passages processed for the JAS project. It is hosted on Kaggle, but specific details on volume, creation date, and author are unavailable.

TextAudioText To SpeechYorubaSpeech SynthesisAligned DataBible+1

0 views

Speech & Audio

Yoruba BibleTTS Aligned: First Epistle of Peter

Yoruba BibleTTS Aligned - 1PE is a dataset published on Kaggle. The title suggests it contains audio and aligned text for speech synthesis, likely derived from the First Epistle of Peter in the Yoruba Bible. The dataset's specific size, author, and creation details are not provided in the available metadata.

TextAudioText To SpeechSpeech SynthesisAudio AlignmentYoruba LanguageBible Text+1

0 views

Speech & Audio

Yoruba BibleTTS Aligned: 1 John Chapter

Yoruba BibleTTS Aligned - 1JN is a dataset for text-to-speech research, likely containing aligned audio and text for the biblical book of 1 John in the Yoruba language. The dataset is hosted on Kaggle, but its specific size, creation details, and structure are not provided in the available metadata. Its primary purpose appears to be supporting the development of speech synthesis models for Yoruba.

TextAudioText To SpeechAligned AudioSpeech SynthesisYoruba LanguageBible Text+1

0 views

Speech & Audio

Yoruba BibleTTS Aligned - 2JN: Yoruba Bible Speech Synthesis Data

Yoruba BibleTTS Aligned - 2JN is a dataset for speech synthesis, likely containing aligned audio and text for the Yoruba language. The dataset title suggests it is sourced from the biblical book of 2 John. It is hosted on Kaggle, but detailed metadata about its size, structure, and creation is unavailable.

TextAudioText To SpeechBible CorpusSpeech SynthesisYoruba Language+1

0 views

Speech & Audio

Yoruba BibleTTS Aligned - 3JN: Aligned Audio and Text for Speech Synthesis

Aligned audio and text data for the Yoruba language, likely derived from the biblical book of 3 John (3JN). The dataset is published on Kaggle, but its specific size, creation date, and author are unknown. Columns suggest it contains paired audio recordings and corresponding text transcripts.

TextAudioSpeech SynthesisAligned DataYoruba LanguageBible Text+1

0 views

Speech & Audio

Yoruba BibleTTS Aligned - JUD: Yoruba Bible Speech Synthesis Dataset

Yoruba-language audio data, likely aligned with biblical text for speech synthesis. The dataset is hosted on Kaggle, but its specific size, creation date, and author are unknown. Columns suggest it contains audio files and corresponding text transcripts.

TextAudioText To SpeechSpeech SynthesisAudio AlignmentYoruba LanguageBible Text+1

0 views

Speech & Audio

Yoruba BibleTTS Aligned: Speech Synthesis Corpus

Yoruba BibleTTS Aligned - REV is a dataset published on Kaggle. The title suggests it contains text and audio data aligned for text-to-speech synthesis, likely derived from the Yoruba Bible. Specific details on size, author, and creation date are unavailable from the provided input.

TextAudioText To SpeechSpeech SynthesisAligned DataYoruba LanguageBible Text+1

0 views

Speech & Audio

Yoruba BibleTTS Accepted (Consolidated): Yoruba Speech Synthesis Dataset

A Yoruba language dataset for text-to-speech (TTS) applications, likely containing audio recordings and corresponding text transcripts. The dataset is titled 'BibleTTS Accepted (Consolidated)', suggesting it may be a curated collection of speech data. It is hosted on Kaggle, but detailed metadata about its size, structure, and origin is unavailable.

TextAudioText To SpeechAudio DataSpeech SynthesisYoruba Language+1

0 views

Speech & Audio

MMS TTS Yoruba: Text-to-Speech Data for Yoruba Language

A text-to-speech dataset for the Yoruba language, published on Kaggle. It is part of the MMS (Massively Multilingual Speech) project by Meta (formerly Facebook). The dataset likely contains audio samples paired with corresponding text transcripts.

AudioText To SpeechMultilingual AiYorubaSpeech Synthesis+1

0 views

Speech & Audio

Legco Speech: Hong Kong Legislative Council Audio with 20,471 Hours of Segmented Speech

22,196 hours of raw audio from Hong Kong Legislative Council meetings, processed into 20,471 hours of segmented speech. The dataset, created by laubonghaudoi, is split into raw and segmented subsets. It was last updated on 2026-02-26.

AudioCantoneseParliamentary ProceedingsAudio ProcessingSpeech Recognition+1

0 views

Speech & Audio

Swivuriso: Over 3000 Hours of Speech Across 7 South African Languages

Swivuriso is a large-scale multilingual speech dataset targeting over 3000 hours of audio across 7 South African languages. The dataset is developed by dsfsi-anv to support Automatic Speech Recognition and inclusive speech technologies for low-resource African languages. It was last updated on the platform in February 2026.

AudioMultilingualParquetCommunity CenteredLibrarypolarsAudio DataLibrarydaskModalitytextSize Categories100 Kn1 MLibrarymlcroissantLibrarydatasetsLicensecc By 40LanguagexhoLanguagezulLanguagevenSouth AfricaArxiv251202201RegionusLarge ScaleTask Categoriesautomatic Speech RecognitionLanguagetsoLanguagetsnLanguagesotMultilingual AudioSouth African LanguagesSpeech RecognitionLow Resource LanguagesLanguagende+1

0 views

Speech & Audio

Contrast Stretching for Audio Signal Processing

Contrast stretching is a common technique for enhancing audio signal dynamics. This dataset, hosted on Kaggle, likely contains audio samples or features for applying or studying this method. The specific content, size, and origin require verification after download due to minimal metadata.

AudioContrastive LearningAudio Processing+1

0 views

Speech & Audio

NOAA Tide Gauge Station Locations for the Massachusetts Coastline

A September 22, 2006 snapshot of tide gauge station locations along the Massachusetts coastline, sourced from the NOAA Tides and Currents website. The Massachusetts Office of Coastal Zone Management compiled the data, which measures the diurnal tide cycle of two high and low tides per day. The dataset shows where NOAA has placed instruments to monitor sea level changes.

GeospatialOceanographyNoaaTide GaugesCoastal Monitoring+1

0 views

PreviousPage 45 of 130Next