LJSpeech: Single Speaker English Speech Dataset

Name: LJSpeech: Single Speaker English Speech Dataset
Creator: flexthink
Published: 2022-03-02T23:29:22
Keywords: Regionus

by flexthinkUpdated 4y ago

Available on 1 platform

Sign in to view source links and access this dataset

Description

13,100 short audio clips and corresponding transcriptions featuring a single speaker reading from 7 non-fiction books. The dataset totals approximately 24 hours of audio with individual clip durations ranging from 1 to 10 seconds.

Use Cases

Train neural text-to-speech (TTS) models using the paired audio clips and transcriptions
Benchmark automated speech recognition (ASR) systems on single-speaker clarity using the 1-10 second segments
Analyze prosody and intonation patterns across 24 hours of non-fiction book readings

Strengths

13,100 individual audio clips with a total duration of approximately 24 hours
Audio segments strictly constrained to lengths between 1 and 10 seconds
Verbatim transcriptions provided for every audio segment in the collection

Regionus

Related Datasets

Quality Score

D28

Description

30

Source

36

Reputation

15

Access

22

Community

141 downloads

4 likes

0 views

Dataset Info

Author: flexthink
Created: Mar 2, 2022
Updated: Feb 6, 2022
Last synced: Apr 29, 2026

Access

22

Community

141 downloads

4 likes

0 views

Dataset Info

Author: flexthink
Created: Mar 2, 2022
Updated: Feb 6, 2022
Last synced: Apr 29, 2026

LJSpeech: Single Speaker English Speech Dataset

Description

Use Cases

Strengths

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info