Librispeech Synth 300h: Synthetic Speech Audio

Available on 1 platform

Sign in to view source links and access this dataset

Description

Librispeech Synth 300h max 5spks is a speech audio dataset published on Kaggle. The title suggests it contains synthetic speech audio derived from the LibriSpeech corpus, likely comprising up to 300 hours of audio from a maximum of five speakers. The specific source, creation method, and exact content require verification after download.

Use Cases

Training or benchmarking text-to-speech (TTS) systems (inferred from domain, verify after download)
Analyzing the characteristics of synthetic versus natural speech (inferred from domain, verify after download)
Developing voice cloning or speaker adaptation techniques (inferred from domain, verify after download)

Strengths

Published on Kaggle, providing a known platform for distribution.

Limitations

Metadata is minimal; actual content requires verification after download.
Column-level documentation is absent; field semantics must be inferred after download.
Row count and file size are unknown, which may limit suitability assessment.

Audio Machine Learning Speech Synthesis Audio Processing

Related Datasets

Quality Score

D16

Description

8

Source

17

Reputation

18

Access

31

Community

0 views

Dataset Info

Last synced: May 31, 2026

Access

31

Community

0 views

Dataset Info

Last synced: May 31, 2026

Librispeech Synth 300h: Synthetic Speech Audio

Description

Use Cases

Strengths

Limitations

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info