Japanese speech audio recordings and transcriptions sampled at 16kHz from various Galgame (visual novel) titles. The dataset is released under the GNU General Public License v3.0 and strictly forbids commercial use of the data or any resulting models.
Use Cases
- Train an automatic speech recognition (ASR) model using the 16kHz audio samples
- Develop open-source Japanese voice models in compliance with the GNU GPL v3.0 license
- Analyze phonetic variations in Japanese speech across different Galgame titles
- Evaluate the accuracy of Japanese ASR systems on stylized character voices
Strengths
- Audio files are standardized to a 16kHz sampling rate
- Dataset is licensed under the GNU General Public License v3.0
- Prohibits commercial use of any models trained using the provided audio data
- Requires any model trained on the dataset to be released as open source