Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Between 10,000 and 100,000 audio clips and transcriptions of Tashkent dialect Uzbek speech comprise this dataset. Collected by islomov from YouTube podcasts like Jahongir Latipov and Bu podcast, it was last updated in June 2025 for Automatic Speech Recognition (ASR) tasks.
Distributed in Parquet format; requires tools like Polars, Dask, or Hugging Face Datasets for efficient loading.