LibriSpeech ASR: 1,000 Hours of Aligned English Audiobook Speech

Name: LibriSpeech ASR: 1,000 Hours of Aligned English Audiobook Speech
Creator: openslr
Published: 2022-03-02T23:29:22
Keywords: Source Datasetsoriginal, Language Creatorsexpert Generated, Librarypolars, Language Creatorscrowdsourced, Librarydask, Languageen, Modalitytext, Size Categories100 Kn1 M, Task Idsspeaker Identification, Librarymlcroissant, Task Categoriesaudio Classification, Librarydatasets, Licensecc By 40, Parquet, Regionus, Task Categoriesautomatic Speech Recognition, Multilingualitymonolingual, Annotations Creatorsexpert Generated

by openslrUpdated 10mo ago

Available on 1 platform

Sign in to view source links and access this dataset

Description

LibriSpeech contains 1,000 hours of 16kHz read English speech derived from LibriVox audiobooks, prepared by Vassil Panayotov and Daniel Povey. The corpus features segmented and aligned audio paired with corresponding text transcripts for speech recognition and speaker identification tasks. The dataset is organized into subsets based on the difficulty of the speech recognition task and the quality of the recordings.

Use Cases

Training automatic speech recognition (ASR) models using the 16kHz audio and text transcript pairs
Speaker identification tasks using the provided speaker labels
Audio classification based on the read audiobook segments

Strengths

1,000 hours of 16kHz audio recordings
Expert-generated alignments between speech and text
Large scale with 100,000 to 1,000,000 records

Limitations

Domain bias toward read audiobook speech rather than natural conversation
Monolingual English coverage only

Provenance

Source: LibriVox project via OpenSLR
Collection Method: Audiobooks were segmented and aligned with text by researchers.
Freshness: Last updated July 2025.

Licensed under CC BY 4.0; data is derived from the LibriVox project and is a standard benchmark in the ASR community.

Parquet Source Datasetsoriginal Language Creatorsexpert Generated Librarypolars Language Creatorscrowdsourced Librarydask Languageen Modalitytext Size Categories100 Kn1 M Task Idsspeaker Identification Librarymlcroissant Task Categoriesaudio Classification Librarydatasets Licensecc By 40 Regionus Task Categoriesautomatic Speech Recognition Multilingualitymonolingual Annotations Creatorsexpert Generated

Related Datasets

Quality Score

C44

Description

51

Source

36

Reputation

61

Access

22

Community

76.5K downloads

220 likes

0 views

Dataset Info

Author: openslr
Created: Mar 2, 2022
Updated: Jul 25, 2025
Last synced: Jun 8, 2026

Access

22

Community

76.5K downloads

220 likes

0 views

Dataset Info

Author: openslr
Created: Mar 2, 2022
Updated: Jul 25, 2025
Last synced: Jun 8, 2026

LibriSpeech ASR: 1,000 Hours of Aligned English Audiobook Speech

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info