LibriSpeech Augmented with MUSAN Noise and Music Samples

Available on 1 platform

Sign in to view source links and access this dataset

Description

A speech audio dataset combining the LibriSpeech corpus with MUSAN augmentation data. The dataset is published on Kaggle, but specific details on size, creation date, and author are not provided in the metadata. Its content likely contains speech recordings augmented with noise and music samples for machine learning training.

Use Cases

Training robust automatic speech recognition (ASR) systems with noise augmentation (inferred from domain, verify after download)
Benchmarking audio data augmentation techniques (inferred from domain, verify after download)
Developing models for speech activity detection in noisy environments (inferred from domain, verify after download)

Strengths

Published on Kaggle, a major platform for data science resources.
Combines two established resources (LibriSpeech and MUSAN) for audio augmentation.

Limitations

Metadata is minimal; actual content requires verification after download.
Row count, file formats, and license information are unknown.
Column-level documentation is absent; field semantics must be inferred after download.

Provenance

Source: Kaggle
Collection Method: Likely a combination of the LibriSpeech corpus and MUSAN augmentation files.
Time Range: null
Freshness: Last update date is unknown; freshness unverified.
Geography: null

null

Audio Machine Learning Augmentation

Related Datasets

Quality Score

D16

Description

8

Source

17

Reputation

18

Access

31

Community

0 views

Dataset Info

Last synced: Apr 24, 2026

Access

31

Community

0 views

Dataset Info

Last synced: Apr 24, 2026

LibriSpeech Augmented with MUSAN Noise and Music Samples

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info