Multiple speech corpora across Automatic Speech Recognition (ASR) and Text-to-Speech (TTS) categories. These resources provide audio recordings and text transcriptions for speech-related machine learning tasks.
Use Cases
- Train speech recognition models using the ASR-labeled audio and transcription data
- Develop voice synthesis systems using the TTS-labeled speech samples
- Benchmark speech processing algorithms across the provided corpus categories
Strengths
- Includes datasets labeled for Automatic Speech Recognition (ASR)
- Includes datasets labeled for Text-to-Speech (TTS)
- Aggregates multiple speech corpora into a single collection