Youtube Video Transkriptleri Tr

Name: Youtube Video Transkriptleri Tr
Creator: Anilosan15
Published: 2025-03-02T07:03:28
Keywords: Librarypolars, Librarydask, Modalityaudio, Size Categoriesn1 K, Modalitytext, Librarymlcroissant, Librarydatasets, Licensecc By 40, Parquet, Languagetr, Regionus, Task Categoriesautomatic Speech Recognition

by Anilosan15Updated 1y ago

Available on 1 platform

Sign in to view source links and access this dataset

Description

5 hours of Turkish audio and text transcripts sourced from over 40 Creative Commons-licensed YouTube videos. The collection features more than 100 distinct speakers with audio resampled to 16 kHz and segmented into clips of up to 25 seconds. It is specifically designed for training and evaluating Turkish speech-to-text models.

Use Cases

Train Turkish speech-to-text (STT) models using the audio clips and their corresponding transcriptions.
Develop speaker recognition or diarization systems leveraging the 100+ unique voices present in the recordings.
Fine-tune acoustic models for Turkish language processing using the 16 kHz resampled audio files.

Strengths

Contains approximately 5 hours of Turkish speech data.
Features audio from over 100 different speakers to ensure vocal variety.
Audio files are standardized at a 16 kHz sampling rate.
Data is segmented into manageable chunks with a maximum duration of 25 seconds.

Parquet Librarypolars Librarydask Modalityaudio Size Categoriesn1 K Modalitytext Librarymlcroissant Librarydatasets Licensecc By 40 Languagetr Regionus Task Categoriesautomatic Speech Recognition

Related Datasets

Quality Score

D37

Description

48

Source

36

Reputation

28

Access

22

Community

17 downloads

2 likes

0 views

Dataset Info

Author: Anilosan15
Created: Mar 2, 2025
Updated: Mar 2, 2025
Last synced: Apr 29, 2026

Access

22

Community

17 downloads

2 likes

0 views

Dataset Info

Author: Anilosan15
Created: Mar 2, 2025
Updated: Mar 2, 2025
Last synced: Apr 29, 2026

Youtube Video Transkriptleri Tr

Description

Use Cases

Strengths

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info