A curated subset of the MTG-Jamendo Autotagging benchmark containing tracks annotated with genre, instrument, and mood/theme tags. Audio files are preprocessed to 30-second clips at a 16kHz sampling rate for consistent music auto-tagging tasks. The dataset was uploaded by author vtsouval and last updated on 2025-05-14.
Use Cases
- Train multi-label audio classifiers based on the described genre, instrument, and mood/theme annotations.
- Benchmark music information retrieval systems based on the preprocessed 30-second audio clips.
- Develop feature extraction models for music similarity or recommendation based on the three tag types.
Strengths
- Preprocessed audio ensures consistent 30-second length and 16kHz sampling rate for model training.
- Dataset is curated to include only tracks with all three annotation types (genre, instrument, mood/theme).
Limitations
- Column-level documentation is absent; field semantics must be inferred after download.
- Row count is unknown, which may limit suitability assessment.
Provenance
- Source
- MTG-Jamendo Autotagging benchmark
- Collection Method
- Curated selection of tracks from the source benchmark.
- Freshness
- Last updated 2025-05-14 13:43:55; freshness should be verified.