A collection of emails likely related to phishing attacks, sourced from Kaggle. The dataset's specific size, origin, and temporal coverage are unknown. It is intended for analysis of deceptive email content.
Use Cases
- Train a classifier to distinguish phishing emails from legitimate ones (inferred from domain, verify after download)
- Analyze linguistic patterns and keywords common in phishing attempts (inferred from domain, verify after download)
- Benchmark phishing detection algorithms (inferred from domain, verify after download)
Limitations
- Metadata is minimal; actual content requires verification after download
- Row count is unknown, which may limit suitability assessment
- Column-level documentation is absent; field semantics must be inferred after download