Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Egyptian Arabic STT Dataset is a synthetic speech dataset containing 50 samples totaling 85.2 seconds of audio. The samples were generated by the Synthetic Egyptian Speech Data Pipeline and have been human-reviewed and quality-validated using Whisper ASR, achieving an average WER of 0.4136 and CER of 0.1642. The dataset focuses on the topic of food ordering.
License is unknown and must be checked before use.