Phishing and Spam Email Dataset for Security Analysis
Available on 1 platform
Sign in to view source links and access this dataset
Description
Kaggle hosts a dataset focused on phishing and spam content. The dataset's specific size, features, and collection method are not detailed in the provided metadata. Its author, organization, and last update date are currently unknown.
Use Cases
Train a binary classifier to distinguish phishing emails from legitimate ones (inferred from domain, verify after download)
Analyze textual patterns and features common to spam messages (inferred from domain, verify after download)
Benchmark the performance of new anomaly detection algorithms for email security (inferred from domain, verify after download)
Strengths
Published on Kaggle, a platform with established data sharing and versioning practices.
Limitations
Metadata is minimal; actual content requires verification after download.
Column-level documentation is absent; field semantics must be inferred after download.
Row count, file formats, and license are unknown, which may limit suitability assessment.
Provenance
Source
Kaggle
Collection Method
Unknown
Time Range
Unknown
Freshness
Last update date is unknown; freshness unverified.
Geography
Unknown
License is unknown; users must verify terms of use before applying the dataset.