Kaggle hosts a dataset of URLs labeled as phishing or safe. The dataset likely contains a large collection of web addresses for security analysis. The title suggests it pertains to data from 2026.
Use Cases
- Training a binary classifier to distinguish phishing URLs from safe ones (inferred from domain, verify after download)
- Feature engineering for URL-based threat intelligence systems (inferred from domain, verify after download)
- Benchmarking phishing detection algorithms on a large-scale dataset (inferred from domain, verify after download)
Limitations
- Metadata is minimal; actual content requires verification after download
- Column-level documentation is absent; field semantics must be inferred after download