500,000 user profiles containing top artists, tracks, albums, and playcounts. The dataset includes rankings, user countries, and MusicBrainz IDs where available, created by GabeKahen and last updated on April 28, 2026. It is designed for modeling music taste and analyzing listening behavior.
Use Cases
- Building music recommender systems based on user-level top artists and tracks.
- Training collaborative filtering models based on playcounts and rankings.
- Modeling music taste and personalization based on aggregated listening history.
- Analyzing user behavior and music consumption patterns based on country data.
Strengths
- Large-scale dataset covering approximately 500,000 users.
- Includes multiple data facets: top artists, tracks, albums, playcounts, rankings, and user countries.
- Integrates MusicBrainz IDs for standardized artist and track identification where available.
Limitations
- Column-level documentation is absent; field semantics must be inferred after download.
- Row count and file size are unknown, which may limit suitability assessment.
- Data may reflect geographic or platform-specific bias inherent to its source.
Provenance
- Source
- huggingface
- Freshness
- Last updated 2026-04-28 18:05:11; freshness should be verified.