Synthetic phishing dataset is hosted on Kaggle. The dataset likely contains simulated data for phishing detection tasks. Metadata is minimal; specifics about size, columns, and provenance are unknown.
Use Cases
- Train a classifier to identify phishing emails or websites (inferred from domain, verify after download)
- Benchmark anomaly detection algorithms on synthetic security data (inferred from domain, verify after download)
- Generate synthetic phishing examples for adversarial training (inferred from domain, verify after download)
Strengths
- Published on Kaggle, a popular platform for data science resources.
Limitations
- Metadata is minimal; actual content requires verification after download.
- Column-level documentation is absent; field semantics must be inferred after download.
- Row count is unknown, which may limit suitability assessment.