Loading...
Loading...
Available on 2 platforms
Sign in to view source links and access this dataset
Lwazi Afrikaans ASR corpus provides matched audio recordings and orthographic transcriptions designed for speech recognition systems. Audio files are telephone-quality, recorded at 8 KHz, 16-bit, and single-channel, with each utterance stored in a separate text file. This dataset was created to support the development of Automatic Speech Recognition (ASR) for the Afrikaans language.
The listed 'last updated' date is in the future (2026), which may indicate incorrect metadata. Audio is telephone-quality, which may limit applicability for high-fidelity speech tasks.