Librispeech Synth 300h max 20spks is an audio dataset published on Kaggle. The title suggests it contains up to 300 hours of synthetic speech audio, likely generated from the LibriSpeech corpus, featuring a maximum of 20 distinct speakers. Its specific creation method and exact content require verification after download.
Use Cases
- Training text-to-speech synthesis models (inferred from domain, verify after download)
- Benchmarking speech recognition systems on synthetic audio (inferred from domain, verify after download)
- Studying speaker characteristics in synthesized speech (inferred from domain, verify after download)
Strengths
- Published on Kaggle, a major data science platform.
Limitations
- Metadata is minimal; actual content requires verification after download.
- Row count, file formats, and column-level documentation are unknown.
- Data may reflect bias inherent to the source corpus used for synthesis.