A curated dataset of voice samples designed for Text-to-Speech voice cloning applications. The dataset includes high-quality audio clips and corresponding metadata, created by sdialog and last updated on December 5, 2025.
Use Cases
- Train voice cloning models based on high-quality audio clips.
- Fine-tune Text-to-Speech systems using curated voice samples.
- Develop synthetic speech applications leveraging the provided audio data.
Strengths
- Dataset is curated for high-quality audio clips suitable for model training.
- Includes a metadata file (metadata.csv) for organizing audio samples.
Limitations
- Row count and total dataset size are unknown, limiting suitability assessment.
- Column-level documentation is absent; field semantics must be inferred after download.
- Freshness should be verified as the last update was on 2025-12-05.
Provenance
- Source
- huggingface
- Freshness
- Last updated 2025-12-05 02:04:43