Loading...
Loading...
Source code corpora, bug reports, vulnerability databases, network intrusion detection, malware samples
1,591 datasets
A collection of synthetic phishing emails, likely for training and evaluating detection models. The dataset is hosted on Kaggle and its columns suggest it contains text content for classification tasks. Specific details on volume, creation date, and authorship are not provided in the available metadata.
Kaggle hosts this dataset related to phishing, a common cybersecurity threat. The dataset's specific content, size, and origin are not detailed in the available metadata. Its structure and features must be verified after download.
Malware samples represented as image files, likely generated by converting raw binary bytes into PNG format. The dataset is hosted on Kaggle and appears to be designed for classification tasks in cybersecurity. Specifics on the number of samples, source, and creation date are not provided in the available metadata.
A dataset for MLOps in network security, sourced from Kaggle. The dataset likely contains features for network intrusion detection system (NIDS) modeling. Specific details on volume, origin, and update history are not provided in the available metadata.
Release 4.2 of the CERT Insider Threat Dataset provides synthetic logs simulating malicious insider activity. The dataset is designed for cybersecurity research, created by the CERT Coordination Center. Its specific temporal coverage is not detailed in the provided input.
A dataset compiled from a systematic literature review (SLR) on cybersecurity and antivirus topics. The dataset is hosted on Kaggle, but the author, organization, and specific temporal coverage are unknown. Its content likely includes structured information extracted from academic papers, such as findings, methodologies, or metadata.
Phishing datasets likely contain examples of malicious and legitimate communications for security analysis. The dataset is hosted on Kaggle, but details on its size, origin, and specific features are not provided in the metadata. The author, organization, and last update date are unknown.
Test images for the MS Malware Classification Big2025 challenge, hosted on Kaggle. The dataset likely contains malware sample images for classification tasks. Metadata is minimal; the exact number of images, file formats, and specific labels require verification after download.
Supplying the replication data and code for the 2025 Report by the AEA Oversight Committee for the Registry of Randomized Controlled Trials. It contains all materials necessary to regenerate the tables, graphs, and figures presented in the official report. The author is Jack Cavanagh, and the data was last updated in January 2026.
Malware analysis datasets containing sequences of API calls. The dataset is hosted on Kaggle, but specific details on the number of samples, collection period, and original authors are not provided in the available metadata. The content likely consists of behavioral logs from malware execution.
Openvul Cwe Hierarchical Mapping provides parent-child relationship data for all Common Weakness Enumerations (CWEs) within the CWE-1000 Research view. Created by Leopo1d and updated in February 2026, the dataset contains fewer than 1,000 records to facilitate prediction-level CWE matching in security contexts.
Comprising MATLAB files used to generate data for the research paper 'Randomization Times under Quantum Chaotic Hamiltonian Evolution' (arXiv:2512.25074). The code was authored by Joaquin Rodriguez Nieva and was last updated in February 2026.
Cybersecurity Intel - May 2026 (Free Sample) is a dataset containing 824 Common Vulnerabilities and Exposures (CVEs) and 1026 advisories. The data covers 11 software ecosystems and is a sample from May 2026. It was sourced from Kaggle, but the original author and collection methodology are not specified.
Ransomware eBPF I/O Dataset likely contains system-level I/O traces captured using eBPF technology. The dataset is hosted on Kaggle, but its specific contents, size, and creation details are not provided in the metadata. Columns, sample data, and authorship are currently unknown.
A dataset from the Microsoft Malware Classification Challenge, likely containing features for classifying malware families. The data was originally published on Kaggle, a platform for data science competitions. Specific details on the number of samples, features, and collection date are not provided in the available metadata.
MATLAB code repository for the journal article 'Winding-based Point-Inclusion Tests for Spherical Polygons'. It consists of M-files and MAT-files, including a test script, and is available under the MIT License.
A balanced subset of data for malware classification tasks, sourced from Kaggle. The dataset's specific size, features, and origin are not detailed in the provided metadata. Further verification after download is required to confirm the exact content and structure.
Bug Report Fix is a dataset published on HuggingFace by user buttersx. The title suggests it likely contains records related to software bug reports and their resolutions. The dataset was last updated on February 18, 2026.
A collection of malware images, likely created by converting executable files into visual representations for analysis. The dataset is hosted on Kaggle, but specifics regarding the number of images, source, and creation date are not provided in the available metadata. Further details about the image format, labeling, and collection methodology require verification after download.
A cleaned dataset from Kaggle, likely containing network traffic features related to Distributed Denial of Service (DDoS) attacks. The dataset's origin, size, and specific time range are not provided in the metadata. Its content and structure must be verified after download.