Librispeech Synth 300h is a synthetic speech audio dataset derived from the LibriSpeech corpus. The title suggests it contains up to 300 hours of generated audio, likely from a maximum of 10 distinct speaker profiles. It is hosted on the Kaggle platform, but detailed metadata about its creation and contents is not provided.
Use Cases
- Training or fine-tuning text-to-speech models (inferred from domain, verify after download)
- Benchmarking speech recognition systems on synthetic audio (inferred from domain, verify after download)
- Studying speaker characteristics in generated speech (inferred from domain, verify after download)
Strengths
- Published on Kaggle, a major platform for data science resources.
Limitations
- Metadata is minimal; actual content requires verification after download.
- Column-level documentation is absent; field semantics must be inferred after download.
- Data may reflect bias inherent to its source corpus and synthesis method.
Provenance
- Source
- Derived from the LibriSpeech corpus.
- Collection Method
- Likely generated via speech synthesis techniques.