Sign in to view source links and access this dataset
Description
A dataset for training Text-to-Speech models, including XTTS_v2, YourTTS, and Tacotron. It contains audio in the LJSpeech format, featuring multiple speakers of the Saudi dialect of Arabic. The dataset was created by Abdelrahman2922 and was last updated on March 30, 2026.
Use Cases
Training multi-speaker TTS models based on the described inclusion of male and female voices.
Fine-tuning XTTS_v2 models based on the dataset's stated design purpose.
Developing dialect-specific Arabic speech synthesis based on the focus on the Saudi dialect.
Creating audio samples for speech synthesis evaluation based on the provided WAV files.
Strengths
Audio is provided in a standard format for TTS research, with a recommended sample rate of 22050 Hz.
Includes multiple speakers, which likely supports training more diverse voice models.
Specifically targets the Saudi dialect of Arabic, addressing a specific linguistic niche.
Limitations
Description metadata is limited; actual data quality, speaker count, and audio length require manual inspection after download.
Row count and total dataset size are unknown, which may limit suitability assessment for large-scale training.
Column-level documentation is absent; field semantics for any associated metadata must be inferred after download.
Provenance
Source
huggingface
Freshness
Last updated 2026-03-30 15:18:56; freshness should be verified.
Geography
Saudi Arabia (implied by dialect focus)
License is unknown; users must verify terms of use before application.