Loading...
Loading...
Source code corpora, bug reports, vulnerability databases, network intrusion detection, malware samples
1,561 datasets
Step-by-step direction for preparing a coordinated response to cyber security incidents, published by the Communications Security Establishment Canada. The guidance is maintained as an official government document and was last updated in March 2026.
Communications Security Establishment Canada provides guidance on identifying phishing tactics in emails, texts, and websites. The resource, updated in March 2026, details common methods used to steal sensitive information.
A dataset for intrusion detection, published on Kaggle. The specific source, collection method, and volume of records are not detailed in the available metadata. The title suggests the data may be relevant for the year 2026.
State of Iowa provides address lists for other state and federal political action committees that file with the Iowa Ethics & Campaign Disclosure Board. The dataset was last updated on March 14, 2026. It is available in multiple formats including XML, RDF, JSON, and CSV.
Address lists for registered candidates and committees that file with the Iowa Ethics & Campaign Disclosure Board. The dataset is provided by the State of Iowa and was last updated on March 14, -2026. The data is available in multiple formats including XML, RDF, JSON, and CSV.
1,000 to 10,000 labeled text samples for detecting indirect prompt injection attacks within repository files, released by prodnull in March 2026. The data covers READMEs, CI/CD workflows, and documentation used as context for AI coding agents.
350 deduplicated OpenClaw agent skill files (SKILL.md) from February 2026, comprising 127 malicious and 223 benign samples. The dataset was created by yoonholee and last updated on Hugging Face on 2026-02-26. Malicious samples were extracted from real campaigns targeting ClawHub and use social engineering in markdown instructions to trick agents into running malware like AMOS (Atomic macOS Stealer).
Palisade Research released this collection of raw Cowrie honeypot logs in March 2026 to support cybersecurity research. The data captures network interactions and shell commands targeting custom LLM agents, accompanied by analysis scripts for processing the raw events.
Nava Ashraf of Harvard University Press designed a randomized control trial for a commitment savings product at a Philippine bank. The study involved a baseline survey of 1777 existing or former bank clients, with the product offered to a randomly selected subset of 710 individuals. After twelve months, average savings balances for the treatment group increased by 81 percentage points relative to the control group.
Healthcare IoMT Security Dataset for Intrusion Detection and ML Research focuses on network security for connected medical devices. Its description suggests it contains data for detecting malicious activity in Internet of Medical Things environments. The dataset's author, organization, and specific collection details are unknown.
CIC-DARKNET2020 is a dataset of darknet traffic designed for intrusion detection and deep learning research. It was sourced from Kaggle, but details about its author, organization, and creation date are unknown. The dataset's exact size, row count, and file formats are also unspecified.
Achilles Tatius' Leucippe and Clitophon is one of five surviving Greek novels from the Roman empire period, noted for its risqué and genre-stretching content. The narrative covers themes like adultery, violence, and improbable happy endings. This dataset likely contains the text of a new Oxford World's Classics translation, aiming to capture the original's exuberant variety.
~700,000 English SMS messages for binary classification of spam and smishing, created by notd5a and last updated in March 2026. The dataset was iteratively refined through multiple rounds of error analysis to improve data quality for cybersecurity applications.
2,899 real-world malware threats are categorized for SOC teams. The dataset likely contains labels for different malware families. Its provenance, including author, organization, and last update date, is unknown.
A legacy government report titled 'BMR marine program Report by a BMR committee on forward marine program'. The report was published by Geoscience Australia Data and is hosted on the data.gov.au platform. The content is described as a legacy product with no abstract available, and the last metadata update was recorded on 2026 -03-25.
Static analysis data from PE section headers, extracted from Cuckoo Sandbox reports. The dataset contains examples of malware from VirusShare and goodware from portableapps.com and Windows 7 directories. It was created as part of PhD research on malware detection and classification using Deep Learning.
A daily updated collection of vulnerability management datasets. The data includes Common Vulnerabilities and Exposures (CVE) identifiers, CISA's Known Exploited Vulnerabilities (KEV) catalog, and Exploit Prediction Scoring System (EPSS) scores. The author and specific row counts are unknown.
International Development Association (IDA) concessional debt outstanding data tracks public and publicly guaranteed loans with a grant element of 35 percent or more. The dataset originates from the World Bank's International Debt Statistics: DSSI initiative. It measures debt service payments and the overall cost of borrowing in current U.S. dollars.
World Development Indicators data measures net official development assistance received as a percentage of central government expense. The dataset is compiled by the World Bank from reports by Development Assistance Committee members, multilateral institutions, and non-DAC countries. The specific temporal coverage and volume of records are not provided.
World Bank data on net Official Development Assistance (ODA) received, expressed as a percentage of a country's imports of goods, services, and primary income. The dataset is part of the World Development Indicators collection, compiled from reports by the Development Assistance Committee (DAC), multilateral institutions, and non-DAC countries. It measures concessional loans and grants intended to promote economic development and welfare in recipient nations.