Merged Fake News: Combined Datasets from PubHealth, LIAR2, and MultiFC
Available on 1 platform
Sign in to view source links and access this dataset
Description
A merged collection of fake news datasets sourced from PubHealth, LIAR2, and MultiFC. The dataset is hosted on Kaggle, but its specific size, structure, and creation details are not provided in the available metadata. The content likely contains text articles or statements with associated veracity labels.
Use Cases
Train a binary classifier to detect fake news articles (inferred from domain, verify after download)
Benchmark multi-class models for claim verification across different source datasets (inferred from domain, verify after download)
Analyze linguistic patterns and features common to misinformation (inferred from domain, verify after download)
Strengths
Published on Kaggle, a platform with established data-sharing infrastructure.
Combines multiple known source datasets (PubHealth, LIAR2, MultiFC), suggesting a potentially broad scope.
Limitations
Metadata is minimal; actual content requires verification after download.
Row count, column definitions, and file formats are unknown, which limits suitability assessment.
License, author, and last update information are absent.
Provenance
Source
Kaggle
Collection Method
Merging of existing datasets (PubHealth, LIAR2, MultiFC); specific methodology unknown.
Time Range
Temporal coverage of the source datasets is unknown.
Freshness
Last updated date is unknown; freshness unverified.
Geography
Spatial coverage of the source datasets is unknown.
License restrictions are unknown; users must verify terms before use.