Urdu-language audio recordings and text transcriptions are provided for automatic speech recognition tasks. The collection features audio files segmented by silence intervals to support the training and fine-tuning of speech models.
Use Cases
- Train automatic speech recognition (ASR) models using the audio recordings and transcription data
- Fine-tune pre-trained speech models on Urdu phonetics using the silence-segmented audio files
- Evaluate speech-to-text model performance by comparing predictions against the provided transcriptions
Strengths
- Urdu language audio files paired with corresponding text transcriptions
- Audio processing includes segmentation based on silence intervals
- Version 1.1 release curated for automatic speech recognition (ASR) tasks