HinFakeNews is a dataset focused on fake news detection in the Hindi language. The dataset is hosted on Kaggle, but specific details about its size, creation date, and authorship are not provided in the available metadata. Its content likely contains text samples labeled as real or fake news for model training.
Use Cases
- Train a binary classifier to detect fake news in Hindi text (inferred from domain, verify after download)
- Benchmark multilingual NLP models on a low-resource language task (inferred from domain, verify after download)
- Analyze linguistic features and patterns in Hindi misinformation (inferred from domain, verify after download)
Strengths
- Published on Kaggle, a platform with established data sharing and versioning tools.
- Focuses on a specific, high-impact domain (fake news) and language (Hindi).
Limitations
- Metadata is minimal; actual content, size, and labeling quality require verification after download.
- Column-level documentation is absent; field semantics must be inferred after download.
- Last update date is unknown; freshness unverified.
Provenance
- Source
- Kaggle
- Collection Method
- null
- Time Range
- null
- Freshness
- null
- Geography
- null