Sign in to view source links and access this dataset
Description
A collection of posts scraped from Reddit subreddits. The description mentions it includes title, author, score, comments, flair, and text content. The dataset's author, size, and last update date are unknown.
Use Cases
Train language models on informal text based on the description mentioning post text.
Analyze community engagement patterns based on the description mentioning score and comments.
Study user flair distribution across subreddits based on the description mentioning flair.
Conduct sentiment analysis on post titles and text based on the description mentioning these fields.
Strengths
The description specifies several key data fields likely to be present, including title, author, score, comments, flair, and text.
The scraper tool appears designed to collect data from any subreddit, suggesting potential for broad topical coverage.
Limitations
Description metadata is limited; actual data quality requires manual inspection after download.
Row count is unknown, which may limit suitability assessment.
Column-level documentation is absent; field semantics must be inferred after download.
Provenance
Source
Reddit.com
Collection Method
Web scraping tool
License is unknown; users must verify compliance with Reddit's terms of service.