Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
SOREVA is a multilingual speech dataset designed for evaluating text-to-speech and speech representation models. It contains approximately 150 audio and transcription samples for each of 49 African languages and dialects. The dataset was created by OlameMend and last updated in December 2025.
License information is not provided.