Librispeech Synth 300h is a speech synthesis dataset derived from the LibriSpeech corpus. It likely contains up to 300 hours of synthetic audio generated from a maximum of 20 speaker voices. The dataset is hosted on Kaggle.
Use Cases
- Training or benchmarking text-to-speech models (inferred from domain, verify after download)
- Studying speaker characteristics in synthetic audio (inferred from domain, verify after download)
- Evaluating the naturalness of machine-generated speech (inferred from domain, verify after download)
Strengths
- Published on Kaggle, a platform for sharing datasets.
Limitations
- Metadata is minimal; actual content requires verification after download.
- Column-level documentation is absent; field semantics must be inferred after download.
Provenance
- Source
- Derived from the LibriSpeech corpus.
- Collection Method
- Synthetic generation, likely via text-to-speech systems.