Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
A cleaned, metadata-rich Shona speech dataset prepared through a reproducible data engineering pipeline. The dataset is derived from the google/WaxalNLP source, specifically the sna_asr subset, and was last updated on March 20, 2026. It is intended as a general-purpose standard corpus for downstream tasks.
License is unknown; users must verify permissions before use.