A collection of tweets about an airline company, manually classified for sentiment. The dataset contains tweet IDs, text, language, and sentiment labels. It was uploaded to OpenML under a permissive CC0 license.
Use Cases
- Train a sentiment classifier for airline-related tweets based on the manually labeled sentiment values.
- Analyze common themes in positive or negative customer feedback based on the tweet text.
- Benchmark NLP model performance on a domain-specific text classification task.
Strengths
- Sentiment labels are manually annotated, which may provide higher quality than automated methods.
- All tweets are in English, simplifying preprocessing for monolingual models.
Limitations
- The row count and dataset scale are unknown, which may limit suitability assessment.
- Column-level documentation beyond the basic description is absent; field semantics must be fully inferred after download.
- The last update date is unknown; freshness is unverified.
Provenance
- Source
- OpenML
- Collection Method
- Tweets were collected and manually classified by the uploader.
- Time Range
- null
- Freshness
- null
- Geography
- null