Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Aggregating between 1,000 and 10,000 manually aligned audio-text pairs from Kazakh commercial songs, released by yeshpanovrustem in 2026. It provides line-level vocal segments designed to investigate the utility of sung speech for low-resource automatic speech recognition (ASR) systems.
The dataset is provided in Parquet format and is compatible with Polars, Dask, and the Hugging Face datasets library. Users should be aware of potential license restrictions related to the use of commercial music recordings.