Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
32,901 paired Amharic speech audio files and transcriptions processed from the BDU-speech dataset by Yohannes A. Ejigu. Updated in March 2026, the collection provides mono audio recordings specifically structured for automatic speech recognition research and model training.
Audio files are decoded as mono but sampling rates vary across the dataset; resampling to a consistent rate (e.g., 16kHz) is recommended before training.