Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
The LibriSpeech corpus contains approximately 1000 hours of read English speech audio, sampled at 16 kHz. It was prepared by Vassil Panayotov with assistance from Daniel Povey, derived from audiobooks in the LibriVox project.
Audio files are in .flac format; users must convert them to float32 arrays using a library like soundfile, as demonstrated in the provided Python code snippet.