Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Encompassing between 10,000 and 100,000 audio clips and transcriptions in the Uzbek language, specifically targeting the Information Technology domain. Collected by islomov from YouTube channels like Mohir Dev and updated in June 2025, it includes English technical terms to improve model generalization. The data is designed for training and evaluating Automatic Speech Recognition (ASR) systems in a technical context.
The dataset is provided in Parquet format and is licensed under Apache 2.0, allowing for broad use in research and commercial applications.