Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
A filtered version of the Common Voice dataset for automatic speech recognition (ASR). Samples with fewer than three words, repetitive tokens, or chat token leaks have been removed. The dataset was created by OpenSpeechHub and was last updated on March 31, 2026.
License is unknown; users must verify permissible use before downloading.