Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
585 hours of 24kHz English speech audio form this multi-speaker corpus derived from LibriVox audiobooks and Project Gutenberg texts. Heiga Zen and Google Speech/Brain team members prepared the dataset specifically for TTS research. The dataset card was last updated in February 2024.
License is listed as 'cc By 40' on the platform but specific terms should be verified on the original dataset page. The dataset is designed for TTS, not automatic speech recognition.