Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
South African English audio recordings and transcriptions for developing Lwazi speech recognition systems. The corpus contains telephone-quality audio files at 8 KHz, 16-bit, mono channel, with corresponding orthographic transcriptions in Unicode text format. Researcher Jaco Badenhorst created this dataset, which was last updated in April 2026.
Dataset access requires visiting the provided handle.net URL; direct download links or file counts are not specified in the description. The 0.0 B size note indicates a very small or metadata-only listing.