DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

Librispeech Synth 300h: Synthetic Speech Audio from Up to 20 Speakers | DataSalon

Home Speech & AudioLibrispeech Synth 300h: Synthetic Speech Audio from Up to 20 Speakers

Speech & Audio

Librispeech Synth 300h: Synthetic Speech Audio from Up to 20 Speakers

Available on 1 platform

Description

Librispeech Synth 300h max 20spks is an audio dataset published on Kaggle. The title suggests it contains up to 300 hours of synthetic speech audio, likely generated from the LibriSpeech corpus, featuring a maximum of 20 distinct speakers. Its specific creation method and exact content require verification after download.

Use Cases

Training text-to-speech synthesis models (inferred from domain, verify after download)
Benchmarking speech recognition systems on synthetic audio (inferred from domain, verify after download)
Studying speaker characteristics in synthesized speech (inferred from domain, verify after download)

Strengths

Published on Kaggle, a major data science platform.

Limitations

Metadata is minimal; actual content requires verification after download.
Row count, file formats, and column-level documentation are unknown.
Data may reflect bias inherent to the source corpus used for synthesis.

Audio Machine Learning Audio Dataset Speech Synthesis Speech Recognition

Related Datasets

Quality Score

D16

Description

Source

Reputation

Quality Score

D16

Description

Source

Reputation

Access

Community

0 views

Dataset Info

Last synced: May 19, 2026

Access

Community

0 views

Dataset Info

Last synced: May 19, 2026

Librispeech Synth 300h: Synthetic Speech Audio from Up to 20 Speakers

Description

Use Cases

Strengths

Limitations

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info