Kaggle dataset for detecting phishing websites. The dataset likely contains features extracted from website URLs and content. Its specific size, origin, and update history are not detailed in the provided metadata.
Use Cases
- Train a binary classifier to distinguish phishing from legitimate websites (inferred from domain, verify after download)
- Feature engineering for URL and web content security analysis (inferred from domain, verify after download)
- Benchmarking machine learning models for cybersecurity applications (inferred from domain, verify after download)
Strengths
- Published on Kaggle, a platform with an active community for data sharing and discussion.
Limitations
- Metadata is minimal; actual content requires verification after download.
- Column-level documentation is absent; field semantics must be inferred after download.
- Row count is unknown, which may limit suitability assessment.