Arabic Saudi Multi-Speaker TTS Dataset in LJSpeech Format

Name: Arabic Saudi Multi-Speaker TTS Dataset in LJSpeech Format
Creator: Abdelrahman2922
Published: 2026-03-30T14:20:11
Keywords: Text To Speech, Saudi Dialect, Arabic Language, Multi Speaker, Audio, Audio Synthesis

by Abdelrahman2922Updated 3mo ago

Available on 1 platform

Sign in to view source links and access this dataset

Description

A dataset for training Text-to-Speech models, including XTTS_v2, YourTTS, and Tacotron. It contains audio in the LJSpeech format, featuring multiple speakers of the Saudi dialect of Arabic. The dataset was created by Abdelrahman2922 and was last updated on March 30, 2026.

Use Cases

Training multi-speaker TTS models based on the described inclusion of male and female voices.
Fine-tuning XTTS_v2 models based on the dataset's stated design purpose.
Developing dialect-specific Arabic speech synthesis based on the focus on the Saudi dialect.
Creating audio samples for speech synthesis evaluation based on the provided WAV files.

Strengths

Audio is provided in a standard format for TTS research, with a recommended sample rate of 22050 Hz.
Includes multiple speakers, which likely supports training more diverse voice models.
Specifically targets the Saudi dialect of Arabic, addressing a specific linguistic niche.

Limitations

Description metadata is limited; actual data quality, speaker count, and audio length require manual inspection after download.
Row count and total dataset size are unknown, which may limit suitability assessment for large-scale training.
Column-level documentation is absent; field semantics for any associated metadata must be inferred after download.

Provenance

Source: huggingface
Freshness: Last updated 2026-03-30 15:18:56; freshness should be verified.
Geography: Saudi Arabia (implied by dialect focus)

License is unknown; users must verify terms of use before application.

Audio Text To Speech Saudi Dialect Arabic Language Multi Speaker Audio Synthesis

Related Datasets

Quality Score

D38

Description

42

Source

36

Reputation

41

Access

26

Community

50 downloads

1 likes

0 views

Dataset Info

Author: Abdelrahman2922
Created: Mar 30, 2026
Updated: Mar 30, 2026
Last synced: May 8, 2026

Access

26

Community

50 downloads

1 likes

0 views

Dataset Info

Author: Abdelrahman2922
Created: Mar 30, 2026
Updated: Mar 30, 2026
Last synced: May 8, 2026

Arabic Saudi Multi-Speaker TTS Dataset in LJSpeech Format

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info