DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

Speech & Audio Datasets | DataSalon

All Categories

🎤

Speech & Audio

Speech recognition, text-to-speech, speaker identification, music classification, audio event detection

2,587 datasets

Speech & Audio

Pashto Speech Corpus for Domain-Specific ASR Research

A domain-specific speech corpus for the Pashto language. It is intended for research in automatic speech recognition and general speech processing. The dataset's author, organization, and size are unknown.

AudioSpeech CorpusDomain SpecificNatural Language ProcessingPashto LanguageAutomatic Speech Recognition+1

0 views

Speech & Audio

Annotated Shona Speech Dataset with Acoustic Speaker Labels

An annotated, speaker-relabelled, and loudness-normalised Shona speech dataset prepared through a reproducible Modal-based data engineering pipeline. This release addresses speaker label contamination in the original source labels by replacing identity columns with acoustically-derived speaker assignments. The dataset is authored by manassehzw and was last updated in March 2026.

AudioOPTIMIZED-PARQUETParquetSize Categories10 Kn100 KText To SpeechTask Categoriestext To SpeechLibrarypolarsLibrarydaskModalitytextAfrican LanguageLibrarymlcroissantLibrarydatasetsLicensecc By 40RegionusTask Categoriesautomatic Speech RecognitionSpeech RecognitionShona+1

0 views

Speech & Audio

ConsolidadoSentenciasRutaIndividualURT

The dataset shows the number of court sentences issued and requests resolved per municipality in the individual restitution route of Colombia's Land Restitution Unit. It includes columns for hectares ordered for restitution, beneficiary counts, and municipality codes. The data is provided by www.datos.gov.co and was last updated on 2026-03-09.

TabularGeospatialCSVXMLJSONLand RestitutionColombiaSolicitudes En SentenciaJudicial DecisionsProperty rightsUrtMunicipal Data+1

0 views

Speech & Audio

Nepali Text-to-Speech Cleaned Audio Dataset

Nepali language audio data for text-to-speech applications, published on HuggingFace by author lilgoose777. The dataset was last updated on 2026-05-05. Its specific size, format, and content details are not provided in the metadata.

AudioText To SpeechAudio DataSpeech SynthesisNepali Language+1

0 views

Speech & Audio

Konkani ASR Dataset: 201 Audio Files for Speech Recognition

201 FLAC audio files specifically collected for training Automatic Speech Recognition models in the Konkani language. The dataset was uploaded to Hugging Face by alvynabranches and was last updated on March 21, -2026. All audio files are organized within a single directory.

AudioParquetLibrarypolarsAudio DataLibrarydaskLicensecc By Nc Nd 40LanguageknnSize Categoriesn1 KModalitytextLibrarymlcroissantLibrarydatasetsSpeech ProcessingRegionusTask Categoriesautomatic Speech RecognitionLanguagekokAutomatic Speech RecognitionLanguagegomKonkani Language+1

0 views

Speech & Audio

Yoruba BibleTTS Aligned: Yoruba Speech Synthesis Data

Yoruba BibleTTS Aligned - DEU is a dataset published on Kaggle. The title and platform tags suggest it contains aligned audio and text data for Yoruba speech synthesis, likely sourced from Bible text. The dataset's specific size, structure, and creation details are unknown.

TextAudioText To SpeechSpeech SynthesisAligned DataYoruba LanguageBible Text+1

0 views

Speech & Audio

Yoruba BibleTTS Aligned - GEN

Yoruba BibleTTS Aligned - GEN is a dataset published on Kaggle. It likely contains audio recordings and corresponding text transcripts aligned for speech synthesis tasks. The dataset's specific size, origin, and creation date are not provided in the available metadata.

TextAudioText To SpeechSpeech SynthesisAudio AlignmentYoruba LanguageBible Text+1

0 views

Speech & Audio

Yoruba BibleTTS Aligned: Speech Synthesis Dataset

A dataset for text-to-speech synthesis in the Yoruba language, likely containing aligned audio and text. It is hosted on Kaggle, but the specific creation date and author are unknown. The dataset's size, exact contents, and creation methodology require verification after download.

TextAudioText To SpeechSpeech SynthesisAudio AlignmentYoruba Language+1

0 views

Speech & Audio

Yoruba BibleTTS Aligned: Text-to-Speech Audio and Text Alignment

Yoruba BibleTTS Aligned is a dataset for speech synthesis, likely containing audio recordings aligned with corresponding text. The dataset is hosted on Kaggle, but detailed metadata such as the number of samples, file formats, and creation details are not provided. Its title suggests a focus on the Yoruba language and the Book of Exodus.

TextAudioText To SpeechSpeech SynthesisAudio AlignmentYoruba Language+1

0 views

Speech & Audio

Yoruba BibleTTS Aligned: Text and Audio for Speech Synthesis

Yoruba BibleTTS Aligned - LEV is a dataset for text-to-speech research, published on Kaggle. The title suggests it contains aligned text and audio data, likely derived from the Yoruba translation of the Bible's Book of Leviticus. Specific details on the number of utterances, audio format, and creation methodology are not provided in the available metadata.

TextAudioText To SpeechSpeech SynthesisAudio AlignmentYoruba LanguageBible Text+1

0 views

Speech & Audio

Yoruba BibleTTS Aligned - 2KI: Yoruba Bible Speech and Text Alignment

Yoruba BibleTTS Aligned - 2KI is a dataset published on Kaggle. The title suggests it contains aligned audio and text data for the biblical book of 2 Kings in the Yoruba language, likely intended for text-to-speech model training. The dataset's author, organization, and other metadata are unknown.

TextAudioText To SpeechSpeech SynthesisAligned DataYoruba LanguageBible Text+1

0 views

Speech & Audio

Yoruba BibleTTS Aligned: Speech and Text Corpus

Yoruba BibleTTS Aligned is a dataset from Kaggle. The title suggests it contains Yoruba language audio and corresponding text, likely aligned for speech synthesis tasks. The dataset's author, organization, and specific size are unknown.

TextAudioText To SpeechBible CorpusSpeech SynthesisAligned DataYoruba Language+1

0 views

Speech & Audio

Yoruba BibleTTS Aligned: Yoruba Speech Synthesis Corpus

Yoruba BibleTTS Aligned - RUT likely contains audio recordings and corresponding text for speech synthesis. The dataset is hosted on Kaggle, but details on its size, creation, and contributors are not provided. Its title suggests it is specifically designed for text-to-speech tasks in the Yoruba language.

TextAudioText To SpeechSpeech SynthesisAligned DataYoruba Language+1

0 views

Speech & Audio

Yoruba BibleTTS Aligned: Speech Synthesis Dataset from JOS

An audio-text dataset for Yoruba speech synthesis, likely containing aligned Bible verses. The dataset is published on Kaggle, but its specific size, creation date, and author are unknown. Its content appears to be derived from the JOS (Joshua Project) Bible translation.

TextAudioText To SpeechAligned AudioSpeech SynthesisYoruba LanguageBible Text+1

0 views

Speech & Audio

Yoruba BibleTTS Aligned - JDG: Yoruba Speech Synthesis Dataset

Yoruba BibleTTS Aligned - JDG is a dataset for text-to-speech synthesis in the Yoruba language. The dataset likely contains audio recordings aligned with corresponding text transcripts, inferred from the title. It is published on Kaggle, but details on its size, creation method, and specific structure are unavailable.

TextAudioText To SpeechSpeech SynthesisAudio AlignmentYoruba Language+1

0 views

Speech & Audio

Yoruba BibleTTS Aligned: First Book of Samuel

Yoruba-language audio and text data for the First Book of Samuel (1SA), aligned for text-to-speech (TTS) applications. The dataset is published on Kaggle, but its creator, size, and specific alignment methodology are unknown. Its content likely consists of Yoruba Bible verses paired with corresponding speech recordings.

TextAudioText To SpeechSpeech SynthesisAligned DataYoruba LanguageBible Text+1

0 views

Speech & Audio

Yoruba BibleTTS Aligned: Text and Audio for Speech Synthesis

A Yoruba language dataset for text-to-speech (TTS) development, likely containing aligned audio recordings and corresponding text transcripts. The dataset is hosted on Kaggle, but its specific size, creator, and update history are unknown. Columns suggest it contains audio files and aligned text, potentially sourced from biblical or other spoken material.

TextAudioText To SpeechSpeech SynthesisAudio AlignmentYoruba Language+1

0 views

Speech & Audio

Yoruba BibleTTS Aligned: Yoruba Speech Synthesis Dataset

A dataset for Yoruba speech synthesis, likely containing audio recordings aligned with corresponding text passages. It is hosted on Kaggle, but the specific creation date, author, and dataset size are unknown. The content appears to be designed for training text-to-speech models for the Yoruba language.

TextAudioText To SpeechSpeech SynthesisAudio AlignmentYoruba Language+1

0 views

Speech & Audio

Yoruba BibleTTS Aligned - 2SA: Aligned Audio and Text for Speech Synthesis

A dataset for text-to-speech synthesis, likely containing aligned audio recordings and corresponding text transcripts. The data appears to be sourced from the Yoruba Bible, as suggested by the title. It is hosted on Kaggle, but specific details on its size, creation date, and author are not provided.

TextAudioText To SpeechBible CorpusSpeech SynthesisYoruba Language+1

0 views

Speech & Audio

Yoruba BibleTTS Aligned - 1KI: Aligned Audio and Text for Speech Synthesis

The dataset title indicates it is an aligned resource for the book of 1 Kings (1KI) from the Yoruba Bible. It is hosted on Kaggle, a platform for data science projects. The specific content, such as the number of audio files or alignment precision, requires verification after download.

TextAudioText To SpeechAligned AudioSpeech SynthesisYoruba LanguageBible Text+1

0 views

PreviousPage 42 of 130Next