DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

Librispeech Synth 300h: Synthetic Speech Audio from Up to 10 Speakers | DataSalon

Home Speech & AudioLibrispeech Synth 300h: Synthetic Speech Audio from Up to 10 Speakers

Speech & Audio

Librispeech Synth 300h: Synthetic Speech Audio from Up to 10 Speakers

Available on 1 platform

Description

Librispeech Synth 300h is a synthetic speech audio dataset derived from the LibriSpeech corpus. The title suggests it contains up to 300 hours of generated audio, likely from a maximum of 10 distinct speaker profiles. It is hosted on the Kaggle platform, but detailed metadata about its creation and contents is not provided.

Use Cases

Training or fine-tuning text-to-speech models (inferred from domain, verify after download)
Benchmarking speech recognition systems on synthetic audio (inferred from domain, verify after download)
Studying speaker characteristics in generated speech (inferred from domain, verify after download)

Strengths

Published on Kaggle, a major platform for data science resources.

Limitations

Metadata is minimal; actual content requires verification after download.
Column-level documentation is absent; field semantics must be inferred after download.
Data may reflect bias inherent to its source corpus and synthesis method.

Provenance

Source: Derived from the LibriSpeech corpus.
Collection Method: Likely generated via speech synthesis techniques.

Audio Machine Learning Audio Dataset Speech Synthesis Speech Recognition

Related Datasets

Quality Score

D16

Description

Source

Reputation

Quality Score

D16

Description

Source

Reputation

Access

Community

0 views

Dataset Info

Last synced: May 21, 2026

Access

Community

0 views

Dataset Info

Last synced: May 21, 2026

Librispeech Synth 300h: Synthetic Speech Audio from Up to 10 Speakers

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info