Personality-traits-of-Twitter-users-(celebrities) is a dataset from OpenML for finding similarities between public figures based on their Twitter activity. It likely contains personality scores for users across five traits, along with word counts and professional categories. The dataset is published under a CC0 1.0 license.
Use Cases
- Cluster celebrities into personality-based groups based on the five trait scores.
- Analyze correlations between a user's professional category and their personality traits.
- Explore potential relationships between Twitter word count and personality trait scores.
Strengths
- Includes scores for five established personality traits (Openness, Conscientiousness, Extraversion, Agreeableness, Neuroticism).
- Contains a professional category label for each user, such as actor or singer.
- Published under a permissive CC0 1.0 public domain license.
Limitations
- Row count, file formats, and column-level documentation are unknown, limiting suitability assessment.
- Last update date and data freshness are unverified.
- The data collection and scoring methodology are not described, which may affect reproducibility.
Provenance
- Source
- OpenML platform.
- Collection Method
- Likely derived from analyzing Twitter activity of public figures, but the specific methodology is not described.
- Time Range
- null
- Freshness
- Last updated date is unknown; freshness unverified.
- Geography
- null