Loading...
Loading...
Source code corpora, bug reports, vulnerability databases, network intrusion detection, malware samples
1,591 datasets
MalwareBazaar Malware Files PDF DOCX XLSX contains real malware samples sourced from the MalwareBazaar platform for security research. The dataset includes files in common document formats such as PDF, DOCX, XLSX, and PPTX. Specific details on the number of samples, collection date, and original author are not provided in the metadata.
A dataset titled 'phishing_ray' is hosted on Kaggle. The dataset's title suggests it likely contains information related to phishing attacks or network security threats. Metadata such as columns, size, license, and update history are unknown and require verification.
CRASAR-U-DROIDs contains 265 orthomosaic images featuring 122,502 views of 21,716 building polygons for disaster response. Created by CRASAR, the collection focuses on building damage assessment and polygon alignment using small Unmanned Aerial Systems (sUAS).
A cybersecurity dataset designed for detecting phishing and homoglyph attacks. The dataset's creator, size, and specific temporal coverage are not provided. It originates from Kaggle.
Marius Guenzel authored this replication package for the study "Excess Commitment in R&D," hosted by the Review of Financial Studies Dataverse. The data supports econometric analysis of corporate research and development investment behaviors and was last updated in March 2026. It provides the necessary code and data to reproduce findings regarding financial decision-making in R&D departments.
MiniMed Prime source code, published on Kaggle, likely contains software artifacts related to a medical device system. The dataset's specific content, size, and provenance details are not provided in the minimal metadata. Users must inspect the actual files to determine the code's scope, language, and structure.
The strategic plan defines a set of interrelated priorities for U.S. government agencies conducting or sponsoring cybersecurity research and development. The priorities are organized into four thrusts: inducing change, developing scientific foundations, maximizing research impact, and accelerating transition to practice. The document originates from the Federal Cybersecurity Research and Development Program.
Intrusion Detection System Datasets is a collection of data for cybersecurity analysis, published on the Kaggle platform. The specific content, size, and features are not detailed in the available metadata. Further verification of the data's structure and purpose is required after download.
Software testing examples published on Kaggle. The dataset likely contains examples of test cases, scenarios, or results used in software development. Metadata is minimal; the specific content, size, and creation details require verification after download.
DDoS attack data published on Kaggle. The dataset likely contains network traffic records related to distributed denial-of-service attacks. Metadata is minimal; the specific content, size, and features require verification after download.
A dataset focused on phishing URLs and the human element in cybersecurity. The description emphasizes human vulnerability as a key security weakness. The dataset's author, size, and temporal details are not provided.
A balanced dataset of 6,200 Malay emails annotated for phishing detection. The dataset is specifically designed for training models to identify phishing attempts in the Malay language. The author and specific collection time range are not provided.
Kaggle hosts a dataset titled 'CCT Source Code'. The dataset's content, size, and specific origin are not detailed in the provided metadata. The title suggests it contains source code files, likely for analysis or educational purposes.
CCT Source Code is a dataset published on Kaggle. The title suggests it contains source code files, likely for analysis or benchmarking. The dataset's specific content, size, and origin are not detailed in the provided metadata.
CICDDOS2019 is a dataset for cybersecurity research, specifically focused on Distributed Denial of Service attacks. It was published on the Kaggle platform. The specific data volume, collection methodology, and time period are not detailed in the available metadata.
Kaggle hosts a dataset titled 'ransomware-ebpf-io-dataset-v478'. The dataset likely contains system call and I/O operation traces collected using eBPF technology, focusing on ransomware behavior. Its specific size, author, and last update date are unknown.
Created by smarie and last updated in March 2026, this collection provides Python code examples for implementing pytest suites. It focuses on the separation of test logic from test data using specific decorators and parametrization techniques.
World Development Indicators is an annual report series initiated in 1998 by the World Bank to track progress toward international development goals. The 2010 edition focuses on progress toward the Millennium Development Goals on the 10th anniversary of the declaration. The data likely contains quantitative metrics for tracking development targets and accountability.
Cybersecurity Cascade Trigger Node Detection is a dataset hosted on Kaggle. The dataset likely contains information related to identifying critical nodes or events within security event cascades. Metadata such as column descriptions, sample data, and size are unavailable, requiring verification after download.
Cybersecurity Cascade Trigger Node Detection 2 is a dataset hosted on Kaggle. Its title suggests it contains data related to detecting trigger nodes within cascading security events in networks. The specific contents, scale, and origin are not detailed in the provided metadata.