Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Pre-extracted audio codec tokens for TTS training, containing 6,082 samples totaling 15.6 hours of audio. The dataset was created by author somu9 and was last updated on 2026-05-18. It uses the MOSS-Audio-Tokenizer-Nano codec at a sample rate of 48,000 Hz and a frame rate of 12.5 Hz.
License is unknown, which is a critical restriction for use. The full data format description is truncated and requires visiting the Hugging Face dataset page.