Kaggle hosts the Irodori-TTS Training Data. The dataset likely contains audio recordings and corresponding text transcripts for training text-to-speech models. Its creator, size, and specific collection date are unknown.
Use Cases
- Training a neural TTS model on paired audio-text data (inferred from domain, verify after download)
- Benchmarking speech synthesis quality on a specific corpus (inferred from domain, verify after download)
- Fine-tuning a pre-trained TTS model for a particular voice or style (inferred from domain, verify after download)
Strengths
- Published on Kaggle, a major platform for data science resources.
Limitations
- Metadata is minimal; actual content requires verification after download.
- Row count, file formats, and column definitions are unknown, which may limit suitability assessment.