Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
NCHLT Speech Corpus Xhosa contains audio recordings of the Xhosa language, a major South African language. The dataset was created by Beijuka and uploaded to Hugging Face in June 2024. It is part of the National Centre for Human Language Technology (NCHLT) initiative.
Data is stored in Parquet format; requires compatible libraries like polars or dask for efficient loading.