Offering a growing collection of captioned anime voice recordings organized by language and speaker splits. It features audio data specifically curated from anime media for speech synthesis and recognition applications, maintained as a dynamic repository.
Use Cases
- Train text-to-speech (TTS) models using the audio recordings and their associated captions
- Develop multi-speaker synthesis systems using the speaker-specific data splits
- Build language classification models using the language-separated audio directories
Strengths
- Contains captioned audio recordings of anime character voices
- Data is partitioned into language-specific subsets
- Includes speaker splits to facilitate multi-speaker model training
- Maintained as a dynamic, expanding collection by ShoukanLabs