Merged Fake News: Combined Datasets from PubHealth, LIAR2, and MultiFC

Available on 1 platform

Sign in to view source links and access this dataset

Description

A merged collection of fake news datasets sourced from PubHealth, LIAR2, and MultiFC. The dataset is hosted on Kaggle, but its specific size, structure, and creation details are not provided in the available metadata. The content likely contains text articles or statements with associated veracity labels.

Use Cases

Train a binary classifier to detect fake news articles (inferred from domain, verify after download)
Benchmark multi-class models for claim verification across different source datasets (inferred from domain, verify after download)
Analyze linguistic patterns and features common to misinformation (inferred from domain, verify after download)

Strengths

Published on Kaggle, a platform with established data-sharing infrastructure.
Combines multiple known source datasets (PubHealth, LIAR2, MultiFC), suggesting a potentially broad scope.

Limitations

Metadata is minimal; actual content requires verification after download.
Row count, column definitions, and file formats are unknown, which limits suitability assessment.
License, author, and last update information are absent.

Provenance

Source: Kaggle
Collection Method: Merging of existing datasets (PubHealth, LIAR2, MultiFC); specific methodology unknown.
Time Range: Temporal coverage of the source datasets is unknown.
Freshness: Last updated date is unknown; freshness unverified.
Geography: Spatial coverage of the source datasets is unknown.

License restrictions are unknown; users must verify terms before use.

Text Media Analysis Fake News Text Classification Natural Language Processing

Related Datasets

Quality Score

D16

Description

8

Source

17

Reputation

18

Access

31

Community

0 views

Dataset Info

Last synced: May 19, 2026

Access

31

Community

0 views

Dataset Info

Last synced: May 19, 2026

Merged Fake News: Combined Datasets from PubHealth, LIAR2, and MultiFC

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info