Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Giving access to segmented audio files and their transcriptions sourced from Emirati TV shows, podcasts, and YouTube channels. It is designed as a benchmark for Automatic Speech Recognition models for the Emirati dialect, covering categories like traditions, cars, health, games, sports, and police. The dataset was created by eabayed and last updated in May 2022.
The dataset consists of a zipped audio file and a TSV transcription file; specific tools for audio processing may be required. License tags indicate 'afl 30', suggesting an Academic Free License, but the exact terms should be verified.