DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

Voices In The Wild 2M | DataSalon

Home Speech & AudioVoices In The Wild 2M

Speech & Audio

Voices In The Wild 2M

Name: Voices In The Wild 2M
Creator: zhifeixie
Published: 2026-05-18T17:44:26
Keywords: Benchmark, Audio, Robustness, Audio Transcription, Time Series, Speech Recognition, Acoustic Conditions

by zhifeixie·Updated 2mo ago

Available on 1 platform

Description

Voices in the Wild 2M is an automatic speech recognition dataset designed for robustness training and evaluation. The dataset contains audio files grouped by normalized acoustic subset, with fields for file paths and reference transcriptions. It was created by author zhifeixie and last updated on Hugging Face in May 2026.

Use Cases

Train speech recognition models for robustness based on diverse acoustic conditions mentioned in the description.
Evaluate model performance across different acoustic subsets based on the dataset's grouping.
Benchmark transcription accuracy using the provided reference transcriptions (answer field).

Strengths

Dataset is explicitly designed for robustness training and evaluation under diverse acoustic conditions.
Audio files are grouped by normalized acoustic subset, which may facilitate targeted analysis.

Limitations

Description metadata is limited; actual data quality requires manual inspection after download.
Row count, file formats, and license are unknown, which may limit suitability assessment.
Column-level documentation is absent; field semantics must be inferred after download.

Provenance

Source: Hugging Face, author zhifeixie.
Collection Method: Likely contains automatically gathered or processed audio and transcription data.
Time Range: null
Freshness: Last updated 2026-05-19 08:51:14; freshness should be verified.
Geography: null

License is unknown, which may restrict commercial or research use.

Audio Time Series Benchmark Robustness Audio Transcription Speech Recognition Acoustic Conditions

Related Datasets

Quality Score

D38

Description

Source

Reputation

Quality Score

D38

Description

Source

Reputation

Access

Community

9 downloads

2 likes

0 views

Dataset Info

Author: zhifeixie
Created: May 18, 2026
Updated: May 19, 2026
Last synced: Jul 21, 2026

Access

Community

9 downloads

2 likes

0 views

Dataset Info

Author: zhifeixie
Created: May 18, 2026
Updated: May 19, 2026
Last synced: Jul 21, 2026

Voices In The Wild 2M

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info