Loading...
Loading...
Source code corpora, bug reports, vulnerability databases, network intrusion detection, malware samples
1,564 datasets
The National Incident-Based Reporting System (NIBRS) offers perspective on the characteristics of the juvenile sex offender population coming to the attention of law enforcement. This dataset, authored by David Finkelhor, addresses a gap in population-based epidemiological information about youth who commit sexual offenses against minors. It likely contains incident-based records from law enforcement reports.
AYI-NEDJIMI's dataset provides 20 incident response playbooks and associated indicators of compromise for cybersecurity operations. The dataset is designed for digital forensic analysis and threat intelligence and was last updated on February 13, 2026. It includes step-by-step procedures for common incident types such as ransomware, phishing attacks, and data breaches.
This dataset supports research on the electoral consequences of mainstream party accommodation strategies toward niche parties, focusing on the issue of European integration. It examines election outcomes and dyadic vote switching across 14 Western European countries from 1988 to 2025. The data is used to test propositions regarding intra-party divisions and voter flows.
Featuring synthetic, tabular data for benign traffic and five attack types in a LAN-SDN environment. It has 36 features, including OpenFlow-dependent values, and is structured across six CSV files from different network topology experiments.
Synthetic data likely simulating Distributed Denial-of-Service (DDoS) attack scenarios. The dataset is hosted on Kaggle, but its creator, size, and specific contents are not detailed in the provided metadata. Its last update date is unknown.
rgeoda is an R library providing spatial data analysis functionalities based on the C++ source code of the open-source GeoDa software. The library offers tools for Exploratory Spatial Data Analysis, Spatial Cluster Detection, Clustering Analysis, and Regionalization. It was authored by Xun Li and is documented on the GeoDa Center website.
Atul M. Tonge's literature review paper synthesizes research on cyber security challenges for society. The paper focuses on emerging trends in mobile computing, cloud computing, e-commerce, and social networking. It also describes challenges due to lack of coordination between agencies and Critical IT Infrastructure.
7,222 Bitcoin seed addresses linked to 67 ransomware families, as identified by the expansion procedure described in the associated research paper. The dataset was compiled by Masarah Paquet-Clouston of CURE International UK and is hosted on the paperswithcode platform.
A dataset named 'datatest for committee' published on Kaggle. The raw description indicates it is a version for threshold testing. The author, organization, and specific details like size and license are unknown.
Commitments data from the International Bank for Reconstruction and Development (IBRD) records new loan amounts for public and publicly guaranteed projects. Figures are expressed in current U.S. dollars. The data is compiled by the International Debt Statistics: DSSI organization.
International Debt Statistics: DSSI provides data on new loan commitments from the International Development Association (IDA). The dataset tracks the sum of new commitments on public and publicly guaranteed loans, measured in current U.S. dollars. The organization responsible for the data is International Debt Statistics: DSSI.
Data on legally binding grant commitments, measured in current U.S. dollars, where no repayment is required. It is compiled by the International Debt Statistics: DSSI organization. The temporal coverage and specific data volume are not specified.
Grants disbursements from new commitments are tracked in current US dollars. The data originates from the Africa Development Indicators organization within the World Bank. The specific temporal coverage and volume of records are not detailed in the provided information.
Kaggle dataset titled PDNet-IDS26, which appears to combine features related to Parkinson's Disease with network data for intrusion detection systems. The description suggests a focus on the year 2026, but the specific data collection period is unknown. The dataset's author, organization, and exact size are not provided.
Labeled HTML pages categorized as benign or phishing for training cybersecurity machine learning models. The dataset is hosted on Kaggle, but the author, organization, and creation date are unspecified. The total number of pages, file formats, and specific features are unknown.
A 1973 report documents the open meeting of the JOIDES planning committee held in Zurich, September 26-28. The report discusses the future of the Deep Sea Drilling Project after 1975. It is a legacy publication from Geoscience Australia with no available abstract or structured data.
AYI-NEDJIMI's AI in Offensive and Defensive Cybersecurity - English Dataset is a bilingual collection covering the use of Artificial Intelligence in cybersecurity from both attacker and defender perspectives. The dataset synthesizes knowledge from articles on topics like LLM-based attack techniques and AI-augmented SIEM. It was last updated on February 13, 2026.
The replication package for a study on the relationship between military conflicts and state-building in pre-imperial China, authored by Joy Chen. It supports an incomplete contract model examining rulers' and local administrators' incentives during defensive and offensive wars. The data underpins empirical tests and historical case analyses published in the Journal of Economic History.
This is the replication package for a study on military conflicts and state-building in pre-imperial China, to be published in the Journal of Economic History. The dataset supports an incomplete contract model examining rulers' and local administrators' incentives during defensive and offensive wars.
UKCCSRC Call 2 Project data from the British Geological Survey focuses on developing multi-phase flow models for CO2 injection into depleted gas fields. The dataset supports the creation of Best Practice Guidelines for safe start-up injection procedures, aiming to validate models and propose optimum injection strategies.