DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

Zero-Day Phishing: Labeled Post-Rendered HTML Pages | DataSalon

Home Software Engineering & SecurityZero-Day Phishing: Labeled Post-Rendered HTML Pages

Software Engineering & Security

Zero-Day Phishing: Labeled Post-Rendered HTML Pages

Available on 1 platform

Description

Labeled HTML pages categorized as benign or phishing for training cybersecurity machine learning models. The dataset is hosted on Kaggle, but the author, organization, and creation date are unspecified. The total number of pages, file formats, and specific features are unknown.

Use Cases

Train binary classifiers to distinguish phishing from benign websites based on HTML content.
Develop feature extraction pipelines for post-rendered web page analysis.
Benchmark model performance on a labeled corpus of HTML pages for cybersecurity applications.

Strengths

Data is explicitly labeled for a binary classification task (benign vs. phishing).
Focuses on post-rendered HTML, which likely captures the final content a user sees.

Limitations

Row count is unknown, which may limit suitability assessment.
Column-level documentation is absent; field semantics must be inferred after download.
Last update date is unknown; freshness unverified.

Provenance

Source: Kaggle

License is unknown; users must verify permissions before use.

Text Machine Learning Cybersecurity Html Analysis Phishing Detection

Related Datasets

Quality Score

D21

Description

Source

Reputation

Quality Score

D21

Description

Source

Reputation

Access

Community

0 views

Dataset Info

Last synced: Apr 25, 2026

Access

Community

0 views

Dataset Info

Last synced: Apr 25, 2026

Zero-Day Phishing: Labeled Post-Rendered HTML Pages

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info