DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

Librispeech Synth 300h: Synthetic Speech Audio from LibriSpeech | DataSalon

Home Speech & AudioLibrispeech Synth 300h: Synthetic Speech Audio from LibriSpeech

Speech & Audio

Librispeech Synth 300h: Synthetic Speech Audio from LibriSpeech

Available on 1 platform

Description

Librispeech Synth 300h is a speech synthesis dataset derived from the LibriSpeech corpus. It likely contains up to 300 hours of synthetic audio generated from a maximum of 20 speaker voices. The dataset is hosted on Kaggle.

Use Cases

Training or benchmarking text-to-speech models (inferred from domain, verify after download)
Studying speaker characteristics in synthetic audio (inferred from domain, verify after download)
Evaluating the naturalness of machine-generated speech (inferred from domain, verify after download)

Strengths

Published on Kaggle, a platform for sharing datasets.

Limitations

Metadata is minimal; actual content requires verification after download.
Column-level documentation is absent; field semantics must be inferred after download.

Provenance

Source: Derived from the LibriSpeech corpus.
Collection Method: Synthetic generation, likely via text-to-speech systems.

Audio Speech Synthesis Audio Processing Speech Recognition

Related Datasets

Quality Score

D16

Description

Source

Reputation

Quality Score

D16

Description

Source

Reputation

Access

Community

0 views

Dataset Info

Last synced: May 19, 2026

Access

Community

0 views

Dataset Info

Last synced: May 19, 2026

Librispeech Synth 300h: Synthetic Speech Audio from LibriSpeech

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info