Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
32,267 audio samples totaling 103.18 hours of Vietnamese speech, curated for automatic speech recognition. The dataset, created by thanhnew2001, was last updated in February 2026. It is structured into 29,041 training and 3,226 development samples.
Data is formatted in the Icefall framework format (train.json, dev.json). Users must refer to the Hugging Face dataset page for the full description and access instructions.