Egyptian Text-Audio Dataset for TTS Model Training

Name: Egyptian Text-Audio Dataset for TTS Model Training
Creator: OmarAhmedSobhy
Published: 2026-04-25T15:03:08
Keywords: Text To Speech, Egyptian-Arabic, Text, Audio, Forced Alignment, Audio Processing

by OmarAhmedSobhyUpdated 2mo ago

Available on 1 platform

Sign in to view source links and access this dataset

Description

An automated pipeline for collecting Egyptian Arabic text-audio pairs from YouTube videos. The dataset is created by OmarAhmedSobhy and was last updated on 2026-04-25. It uses forced alignment and automatic speech recognition models to process the audio and text.

Use Cases

Train text-to-speech models based on Egyptian Arabic audio-text pairs.
Fine-tune forced alignment models based on the described processing pipeline.
Benchmark automatic speech recognition performance on Egyptian Arabic dialects.
Develop pronunciation dictionaries for Egyptian Arabic based on aligned audio segments.

Strengths

The pipeline is automated, suggesting potential for scalable data collection.
Utilizes specific, named models (mms-300m-1130-forced-aligner, Cohere ASR) for processing.
Focuses on Egyptian Arabic, a specific dialect not covered by many general TTS datasets.

Limitations

Description metadata is limited; actual data quality requires manual inspection after download.
Row count, file formats, and column-level documentation are absent.
Data may reflect geographic and source bias inherent to YouTube content.

Provenance

Source: YouTube videos, processed via an automated pipeline.
Collection Method: Automated downloading, audio processing, and forced alignment using specified models.
Time Range: null
Freshness: Last updated 2026-04-25 16:24:38; freshness should be verified.
Geography: Egyptian Arabic dialect focus.

License is unknown; source code for the collection pipeline is referenced but not included in the dataset listing.

Text Audio Text To Speech Egyptian-Arabic Forced Alignment Audio Processing

Related Datasets

Quality Score

D32

Description

27

Source

36

Reputation

40

Access

26

Community

23 downloads

1 likes

0 views

Dataset Info

Author: OmarAhmedSobhy
Created: Apr 25, 2026
Updated: Apr 25, 2026
Last synced: May 7, 2026

Access

26

Community

23 downloads

1 likes

0 views

Dataset Info

Author: OmarAhmedSobhy
Created: Apr 25, 2026
Updated: Apr 25, 2026
Last synced: May 7, 2026

Egyptian Text-Audio Dataset for TTS Model Training

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info