Over 114 hours of high-quality Persian audio sampled at 44.1 kHz, released under the CC-0 license. Collected from Nasl-e-Mana magazine, the dataset covers a diverse range of topics. It was created by MahtaFetrat and last updated on July 12, 2025.
Use Cases
- Train text-to-speech models based on the high-quality, single-speaker audio.
- Fine-tune speech synthesis systems for Persian based on the large volume of speech data.
- Develop educational or commercial voice applications based on the permissive CC-0 license.
- Research prosody and speech patterns in Persian based on the diverse topics covered.
Strengths
- Over 114 hours of audio provides substantial training material.
- High-quality audio sampled at 44.1 kHz suggests good fidelity.
- Permissive CC-0 license allows for unrestricted educational and commercial use.
Limitations
- Column-level documentation is absent; field semantics must be inferred after download.
- Row count and file formats are unknown, which may limit suitability assessment.
Provenance
- Source
- Nasl-e-Mana magazine
- Collection Method
- Collected from magazine content.
- Freshness
- Last updated 2025-07-12 12:32:59; freshness should be verified.
- Geography
- Persian language content.