Open Slr108 Turkish 10 Hours

Name: Open Slr108 Turkish 10 Hours
Creator: emre
Published: 2022-03-02T23:29:22
Keywords: Arxiv210316193, Licensecc By 40, Regionus, Robust Speech Event

by emreUpdated 3y ago

Available on 1 platform

Sign in to view source links and access this dataset

Description

10 hours of Turkish media speech audio clips designed for evaluating Automated Speech Recognition (ASR) systems. This dataset is part of the MediaSpeech collection which also covers French, Arabic, and Spanish languages.

Use Cases

Benchmark the word error rate (WER) of Turkish ASR models using the provided media speech audio
Fine-tune speech-to-text systems on Turkish broadcast media characteristics
Conduct cross-linguistic ASR performance comparisons by combining this data with the French, Arabic, and Spanish subsets of SLR108

Strengths

10 hours of audio recordings specifically in the Turkish language
Part of the SLR108 MediaSpeech collection covering four major languages
Distributed under the Creative Commons Attribution 4.0 International License
Comprised of short speech segments extracted from media sources

Arxiv210316193 Licensecc By 40 Regionus Robust Speech Event

Related Datasets

Quality Score

D34

Description

48

Source

36

Reputation

12

Access

22

Community

29 downloads

4 likes

0 views

Dataset Info

Author: emre
Created: Mar 2, 2022
Updated: Dec 6, 2022
Last synced: Apr 14, 2026

Access

22

Community

29 downloads

4 likes

0 views

Dataset Info

Author: emre
Created: Mar 2, 2022
Updated: Dec 6, 2022
Last synced: Apr 14, 2026

Open Slr108 Turkish 10 Hours

Description

Use Cases

Strengths

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info