Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Preprocessed text data sourced from Reddit, intended for training or evaluating Automatic Speech Recognition (ASR) systems. The dataset was created by DDSC and last updated on the Hugging Face platform in February 2022. Its size is indicated as between 1 million and 10 million entries.
License terms are unknown and must be verified before use. The specific preprocessing steps applied to the Reddit text are not detailed.