A multilingual text dataset for fake news detection, containing content in both Bengali and English. It is hosted on Kaggle, but the author, organization, and creation details are unspecified. The dataset's size, specific contents, and collection methodology are not described in the available metadata.
Use Cases
- Training a binary classifier to identify fake news articles (inferred from domain, verify after download)
- Benchmarking cross-lingual transfer learning models for misinformation detection (inferred from domain, verify after download)
- Analyzing linguistic features of misinformation across different languages (inferred from domain, verify after download)
Strengths
- Published on Kaggle, a platform with established data sharing and versioning tools.
Limitations
- Metadata is minimal; actual content requires verification after download.
- Column-level documentation is absent; field semantics must be inferred after download.
- Row count and file formats are unknown, which may limit suitability assessment.