A dataset for detecting phishing URLs, published on Kaggle. The specific number of records, features, and collection methodology are not detailed in the available metadata. Further details about the dataset's origin, size, and structure require verification after download.
Use Cases
- Train a binary classifier to distinguish phishing URLs from legitimate ones (inferred from domain, verify after download)
- Benchmark low-latency detection algorithms for web security applications (inferred from domain, verify after download)
- Analyze features common to malicious URLs for threat intelligence (inferred from domain, verify after download)
Strengths
- Published on Kaggle, a platform with an established data science community.
Limitations
- Metadata is minimal; actual content requires verification after download.
- Column-level documentation is absent; field semantics must be inferred after download.
- Row count and data scale are unknown, which may limit suitability assessment.