A speech audio dataset combining the LibriSpeech corpus with MUSAN augmentation data. The dataset is published on Kaggle, but specific details on size, creation date, and author are not provided in the metadata. Its content likely contains speech recordings augmented with noise and music samples for machine learning training.
Use Cases
- Training robust automatic speech recognition (ASR) systems with noise augmentation (inferred from domain, verify after download)
- Benchmarking audio data augmentation techniques (inferred from domain, verify after download)
- Developing models for speech activity detection in noisy environments (inferred from domain, verify after download)
Strengths
- Published on Kaggle, a major platform for data science resources.
- Combines two established resources (LibriSpeech and MUSAN) for audio augmentation.
Limitations
- Metadata is minimal; actual content requires verification after download.
- Row count, file formats, and license information are unknown.
- Column-level documentation is absent; field semantics must be inferred after download.
Provenance
- Source
- Kaggle
- Collection Method
- Likely a combination of the LibriSpeech corpus and MUSAN augmentation files.
- Time Range
- null
- Freshness
- Last update date is unknown; freshness unverified.
- Geography
- null