10 categories of spoken digits (0-9) provided in an audio format. This dataset serves as an acoustic counterpart to the MNIST handwritten digit collection for speech recognition tasks.
Use Cases
- Train a classification model to identify spoken digits from raw audio waveforms
- Benchmark audio feature extraction techniques like MFCCs against a standardized digit set
- Develop lightweight speech-to-text systems for numeric input validation
Strengths
- 10 distinct classes representing spoken digits from 0 to 9
- Audio-based format designed for speech recognition and signal processing
- Open-source availability for benchmarking acoustic classification models