Librispeech Test Clean

Name: Librispeech Test Clean
Creator: AudioLLMs
Published: 2024-07-15T04:15:54
Keywords: Size Categories1 Kn10 K, Librarypolars, Librarydask, Modalityaudio, Modalitytext, Librarymlcroissant, Librarydatasets, Parquet, Regionus

by AudioLLMsUpdated 1y ago

Available on 1 platform

Sign in to view source links and access this dataset

Description

2,620 high-quality audio clips and transcriptions derived from public domain audiobooks for evaluating speech recognition systems. The data is categorized as "clean" due to its low noise levels and high recording quality compared to other LibriSpeech subsets.

Use Cases

Calculate Word Error Rate (WER) for speech-to-text models by comparing predictions to the text field
Test the zero-shot capabilities of Audio Large Language Models using the audio input and text ground truth
Perform speaker verification or identification tasks using the speaker_id labels

Strengths

2,620 audio samples paired with normalized text transcriptions
Audio files provided in 16kHz FLAC format to ensure lossless signal quality
Metadata includes speaker_id, chapter_id, and id for tracking source audiobooks

Parquet Size Categories1 Kn10 K Librarypolars Librarydask Modalityaudio Modalitytext Librarymlcroissant Librarydatasets Regionus

Related Datasets

Quality Score

D35

Description

39

Source

36

Reputation

33

Access

22

Community

367 downloads

2 likes

0 views

Dataset Info

Author: AudioLLMs
Created: Jul 15, 2024
Updated: Mar 13, 2025

Access

22

Community

367 downloads

2 likes

0 views

Dataset Info

Author: AudioLLMs
Created: Jul 15, 2024
Updated: Mar 13, 2025

Librispeech Test Clean

Description

Use Cases

Strengths

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info