Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Pidgin ASR Combined is a unified Nigerian Pidgin English speech-to-text dataset created by michaelodafe. It contains approximately 8.6 hours of audio across 4,278 clips from 10 source speakers, formatted as 16 kHz mono WAV files. The dataset was last updated on 2026-05-13 and was used to train a Whisper model that achieved a 21.37% word error rate.
License is unknown; users should verify terms of use before downloading.