Sign in to view source links and access this dataset
Description
Real-time news articles dataset collected using NewsAPI and Python. The dataset contains 100 articles, though the specific sources, time range, and collection methodology are not detailed. It was posted on Kaggle, but the author, organization, and license are unknown.
Use Cases
Train a text classification model based on article content.
Perform sentiment analysis on real-time news headlines.
Analyze topic trends over time based on the news corpus.
Build a news summarization tool based on the article text.
Strengths
Data is described as 'real-time', suggesting potential recency.
Collection method is specified as using NewsAPI and Python.
Limitations
Description metadata is limited; actual data quality requires manual inspection after download.
Column-level documentation is absent; field semantics must be inferred after download.
Row count is unknown, which may limit suitability assessment.
Provenance
Source
NewsAPI
Collection Method
Collected using Python scripts.
License is unknown; users should verify terms of use.