Uzbek Speech Corpus

Name: Uzbek Speech Corpus
Creator: issai
Published: 2025-01-17T13:06:08
Keywords: Languageuz, Modalityaudio, Audio, Regionus, Task Categoriesautomatic Speech Recognition, Licensemit

by issaiUpdated 1y ago

Available on 1 platform

Sign in to view source links and access this dataset

Description

105 hours of manually checked Uzbek speech recordings featuring 958 unique speakers. The dataset includes transcribed audio files designed for speech recognition tasks in the Uzbek language.

Use Cases

Train automatic speech recognition (ASR) models using the transcribed audio recordings
Perform speaker identification tasks using the 958 unique speakers
Conduct linguistic analysis of the Uzbek language using the manually verified transcriptions

Strengths

105 hours of transcribed audio recordings
958 unique speakers represented in the corpus
Manually checked transcriptions to ensure high data quality

Audio Languageuz Modalityaudio Regionus Task Categoriesautomatic Speech Recognition Licensemit

Related Datasets

Quality Score

D38

Description

48

Source

36

Reputation

32

Access

22

Community

65 downloads

5 likes

0 views

Dataset Info

Author: issai
Created: Jan 17, 2025
Updated: Feb 13, 2025

Access

22

Community

65 downloads

5 likes

0 views

Dataset Info

Author: issai
Created: Jan 17, 2025
Updated: Feb 13, 2025

Uzbek Speech Corpus

Description

Use Cases

Strengths

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info