Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Filtered GOL Dataset is a Japanese text-to-speech resource containing approximately 1.2 million audio samples totaling 1,880 hours from 380 speakers. It was filtered by tts-dataset for TTS training, applying rules on text length, audio duration, and speaker minimums. The audio is in FLAC format at 44.1kHz and is packaged as a WebDataset.
Dataset is approximately 280GB in size and packaged in the WebDataset (.tar) format, which requires specific libraries for loading. License information is not provided in the input.