Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Featuring high-quality conversational audio samples for Automatic Speech Recognition tasks in Vietnamese, Korean, Arabic, and Filipino. It includes paired audio and transcripts of natural, non-scripted speech, featuring both single-speaker and dual-speaker interactions. Audio specifications include a sampling rate of 16 kHz to 24 kHz and a 16-bit bit depth.
The full description and detailed specifications are available only on the linked Hugging Face dataset page. License information is unknown.