Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Between 100,000 and 1,000,000 Uzbek language audio segments and transcriptions sourced from YouTube by openbank-uz in early 2026. The collection utilizes vocal isolation to separate speakers and Google's Gemini 2.0 Flash model for automated transcription.
Distributed under a Creative Commons Attribution Non-Commercial 4.0 license; users should be prepared to process Arrow format files.