Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
HiFiTTS-2 is a large-scale speech dataset from NVIDIA, containing metadata for approximately 36.7 thousand hours of audio derived from LibriVox audiobooks. The metadata includes estimated bandwidth and corresponds to audio from 5 thousand speakers, recorded at a 48 kHz sampling rate. The dataset was last updated on the platform in November 2025.
License is unknown; users must verify licensing terms before use. Audio files are not included in the dataset and must be downloaded separately from LibriVox.