Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Annotated Russian audio data for tasks like text-to-speech, speech recognition, and speaker identification. The dataset includes features such as text transcriptions, speaker identifiers, audio files, utterance pitch statistics, and signal-to-noise ratio (SNR). It was created by author kijjjj and last updated in June 2025.
The full description, including details on the snr feature, is available only on the Hugging Face dataset page. The dataset uses the Parquet format and is compatible with libraries like Polars, Dask, and mlcroissant.