Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Dolly-Audio contains 1,000 hours of professionally cleaned Vietnamese speech audio featuring 152 speakers from various regions. Created by the Dolly AI Team and updated in December 2024, the corpus is designed to support speech synthesis and recognition research. It includes both audio recordings and corresponding text transcripts across multiple Vietnamese dialects.
The dataset is distributed in optimized Parquet format; users should utilize libraries such as Hugging Face Datasets, Polars, or Dask for efficient data loading and processing.