Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
somu9's Hindi Tokens dataset contains 305,847 pre-extracted audio codec tokens for text-to-speech training. The data comprises 544.2 hours of Hindi audio, with an average sample duration of 6.4 seconds. It was last updated on June 2, 2026.
License is unknown; users should verify terms of use before downloading.