Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Voices in the Wild 2M is an automatic speech recognition dataset designed for robustness training and evaluation. The dataset contains audio files grouped by normalized acoustic subset, with fields for file paths and reference transcriptions. It was created by author zhifeixie and last updated on Hugging Face in May 2026.
License is unknown, which may restrict commercial or research use.