Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Packed with approximately 1000 hours of read English speech audio, prepared by Vassil Panayotov with assistance from Daniel Povey. It is derived from LibriVox audiobooks, segmented and aligned, with a 16 kHz sampling rate.
Audio files are in FLAC format; conversion to float32 arrays is required for typical ML pipelines, as demonstrated in the provided Python code snippet.