Loading...
Loading...
Source code corpora, bug reports, vulnerability databases, network intrusion detection, malware samples
1,591 datasets
Plans of Action and Milestones (POA&M) are corrective action plans required by the Department of Homeland Security for tracking and resolving information security weaknesses. The dataset contains these plans as assigned to agencies for remediation, with the last update recorded in January 2026. Specific details on the number of plans, rows, or columns are not provided in the input.
A dataset titled 'Malware' is hosted on Kaggle. The dataset's specific content, size, and origin are not detailed in the provided metadata. Its columns, sample data, and other descriptive attributes are currently unknown.
Malware samples acquired from the Malware Bazaar platform to create a dataset for detection tasks. The dataset's author, organization, and specific temporal coverage are not provided. The data is hosted on Kaggle and is tagged for cyber security applications.
4,898,431 connection records categorized into 23 attack types and a 'normal' class. The data includes 41 features per connection, such as protocol_type, service, and src_bytes, derived from raw TCP dump data.
GitHub pull request data collected from 24 software repositories. The dataset likely contains information related to code review and collaboration processes on the platform. It is hosted on Kaggle, but specific details about its creation, size, and structure are not provided in the metadata.
A dataset from Kaggle titled 'Commit Sklearn Lib'. The title suggests it contains version control data, likely commit histories, related to the scikit-learn machine learning library. The dataset's specific content, size, and origin are not detailed in the provided metadata.
A collection of image files likely representing malware and benign software samples. The dataset is hosted on Kaggle, but details on the number of images, creation date, and author are unknown. Columns and sample data are unavailable for inspection.
A dataset hosted on Kaggle, likely intended for fine-tuning machine learning models to detect phishing attempts. The title suggests it contains examples of phishing-related data, but specific content, size, and features are not detailed in the provided metadata. Further verification is required to confirm the dataset's structure and intended application.
A dataset titled 'phishing-email' is hosted on Kaggle. The dataset's content, size, and specific attributes are not described in the provided metadata. Its actual composition and scale require verification after download.
An email dataset focused on phishing content, sourced from Kaggle. The dataset likely contains email text and labels for phishing classification. Metadata is minimal; specifics about size, columns, and provenance are unknown.
Phishing dataset(email) is a collection of email data hosted on Kaggle, likely intended for cybersecurity research. The dataset's specific content, size, and origin are not detailed in the provided metadata. Users must download the dataset to verify its structure and suitability for their tasks.
Edge-IIoT Balanced Subset for Intrusion Detection likely contains data related to cybersecurity in Industrial Internet of Things environments. The dataset is hosted on Kaggle, but specific details about its size, creator, and update date are unavailable. Columns likely suggest network traffic or system event logs.
Synthetic phishing dataset is hosted on Kaggle. The dataset likely contains simulated data for phishing detection tasks. Metadata is minimal; specifics about size, columns, and provenance are unknown.
A dataset titled 'phishing_url' is hosted on Kaggle. The dataset likely contains URLs labeled as legitimate or phishing for security analysis. Metadata such as column details, size, and license are currently unknown.
A merged collection of email data from three sources: Enron, Nazario, and SpamAssassin. The dataset likely contains emails labeled as phishing or spam, intended for security research. It is hosted on Kaggle, but specific details about its size, structure, and creation date are unknown.
Packet capture (PCAP) files likely containing network traffic data from Distributed Denial of Service (DDoS) attacks. The dataset is hosted on Kaggle, but details on its size, collection method, and time range are not provided in the metadata. The author, organization, and specific license are also unknown.
A cybersecurity dataset published on Kaggle. The title suggests it may contain network or system security data, potentially related to intrusion detection or threat analysis. The dataset's specific contents, size, and origin require verification after download.
A merged collection of emails from three established sources: the Enron corpus, the Nazario phishing corpus, and the SpamAssassin public corpus. The dataset is hosted on Kaggle, but specific details like row count, file formats, and license are not provided in the metadata. Its content likely contains a mix of legitimate, spam, and phishing emails for analysis.
A collection of emails likely related to phishing attacks, sourced from Kaggle. The dataset's specific size, origin, and temporal coverage are unknown. It is intended for analysis of deceptive email content.
A dataset for detecting phishing URLs, published on Kaggle. The specific number of records, features, and collection methodology are not detailed in the available metadata. Further details about the dataset's origin, size, and structure require verification after download.