Kaggle hosts the WAVLM_Gender(VF) dataset. The title suggests it contains speech audio data likely intended for gender classification tasks. Specific details on volume, creator, and creation date are unavailable.
Use Cases
- Training a gender classifier from speech audio features (inferred from domain, verify after download)
- Benchmarking speech representation models on speaker attribute tasks (inferred from domain, verify after download)
- Analyzing acoustic correlates of perceived gender in voice (inferred from domain, verify after download)
Strengths
- Published on Kaggle, a platform with established data sharing infrastructure.
Limitations
- Metadata is minimal; actual content requires verification after download.
- Column-level documentation is absent; field semantics must be inferred after download.
- Row count is unknown, which may limit suitability assessment.