A dataset containing 15,000 audio samples of a male Dutch Flemish voice. It was created by fibleep and ported from the dutch-vl-tts GitHub repository to the Hugging Face platform. The data was last updated on April 16, 2024, and originates from the Mozilla Common Voice project's Dutch language data.
Use Cases
- Train a text-to-speech model based on the 15,000 male Flemish voice samples.
- Fine-tune a speech synthesis model for the Dutch Flemish dialect based on the described audio data.
- Benchmark or augment existing speech datasets with Flemish audio based on the described source.
Strengths
- Contains 15,000 audio samples, providing a substantial base for model training.
- Sourced from the Mozilla Common Voice project, a known open-source speech data initiative.
Limitations
- Description metadata is limited; actual data quality requires manual inspection after download.
- Column-level documentation is absent; field semantics must be inferred after download.
- Row count is unknown, which may limit suitability assessment.
Provenance
- Source
- Mozilla Common Voice project (Dutch language subset).
- Collection Method
- Extracted from the Common Voice server data and ported from a GitHub repository.
- Time Range
- null
- Freshness
- Last updated 2024-04-16 14:52:25; freshness should be verified.
- Geography
- Likely focused on Flemish (Dutch-speaking Belgium) regions.