Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
English Wikipedia text and ASR error data presented in an ASRU-2023 paper. It contains 4.3 million unique words or phrases from Wikipedia titles occurring in 33.8 million paragraphs, plus 26 million phrase pairs representing ASR recognition errors. The dataset was created by bene-ges and last updated on Hugging Face in December 2023.
License is unknown; users should verify terms of use before applying the data.