Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
156 hours of high-fidelity Urdu audio address a critical under-resourcing in speech technology. The corpus contains 71,792 diarized utterances across three specialized subsets: Standard Pakistani Urdu, Urdu-English Code-Switched, and Pakistani-Accented English. It was created by ASLP-lab and last updated in June 2026.
License information is unknown and should be confirmed before use.