Loading...
Loading...
Source code corpora, bug reports, vulnerability databases, network intrusion detection, malware samples
1,564 datasets
Joint Nature Conservation Committee (JNCC) conducted a marine survey of Stanton Banks from 17/Aug/2012 to 01/Sep/2012. The survey collected sea floor multibeam bathymetry data using a Kongsberg EM3002D system, alongside seabed imagery, grabs, and dredges.
A bilingual dataset for incident response, digital forensic analysis, and threat intelligence. It contains 20 step-by-step incident response playbooks covering common incident types, created by author AYI-NEDJIMI and last updated on February 13, 2026.
A dataset titled 'malwareSample' published on Kaggle. The dataset likely contains samples or features related to malicious software. Metadata is minimal; the specific content, size, and origin require verification after download.
Monthly-updated records of all financial payments exceeding £25,000 made by Registers of Scotland. The data is published by Registers of Scotland as part of the UK Government's commitment to expenditure transparency. The dataset was last updated on 2026-03-17.
A collection of 11,001 website URLs labeled for phishing detection. Each sample includes 15 website parameters and a binary class label, where 0 indicates a phishing URL and 1 indicates a legitimate one. The dataset is provided under a CC-BY-4.0 license on the OpenML platform.
Malware_benign_API is a dataset from Kaggle. It likely contains sequences of API calls made by software, distinguishing between malicious and benign programs. The specific number of samples, features, and collection methodology are unknown from the provided metadata.
SPIN-IDS Dataset full is a cybersecurity dataset published on Kaggle. The title suggests it contains data for network intrusion detection systems. The dataset's specific content, size, and provenance require verification after download.
Passports of budget programs detail the financial management plans of the Malyn City Executive Committee. The dataset is hosted on the States site of Ukraine and was last updated on February 19, 2026. It likely contains structured descriptions of budget allocations, objectives, and performance metrics.
Gaurav Sood provides an R interface to the VirusTotal API, a Google service for analyzing files and URLs for malware. The client supports API versions 2 and 3, offering features like file scanning, URL scanning, domain categorization, passive DNS information, and IP reputation analysis. It implements rate limiting, error handling, and response validation for security analysis workflows.
Source code files published on Kaggle under the identifier 'KLTN'. The dataset's specific purpose, size, and authorship are not detailed in the provided metadata. The content likely consists of programming language files for a software project or educational exercise.
SCV-1-2000 is a JSONL dataset containing 2000 entries focused on advanced and unconventional smart contract vulnerabilities, with an emphasis on Decentralized Finance (DeFi) protocols. It was created by author pug30 and last updated on February 12, 2026. The dataset is intended for training and evaluating models in smart contract security.
Milton S. Katz traces the development of the National Committee for a Sane Nuclear Policy (SANE). The work examines the organization's efforts for nuclear disarmament over a period from 1957 to 1985. The dataset is sourced from the paperswithcode platform.
Committee and bill introduction data for the lower chambers of select southern state legislatures. The dataset was authored by Michael Olson and is hosted on the Harvard Dataverse. It was last updated on March 18, -2026.
MalwareVis is a dataset published on Kaggle, focusing on malware analysis. The dataset likely contains features for visualizing or classifying malicious software behavior. Its specific contents, size, and creation details require verification after download.
TFM-Malware-Processed-Data is a Kaggle dataset likely containing processed features for malware analysis. The dataset's title suggests it may include engineered attributes derived from raw malware samples. Its author, organization, and specific collection details are not provided in the available metadata.
An annotated dataset of 3D CAD models for training reverse engineering models. It contains raw CAD data, rendered images from multiple viewpoints, and semantic part labels. The dataset includes train/test splits for three furniture categories, including over 2,000 annotated chairs.
A dataset of network traffic related to Distributed Denial-of-Service attacks, published on Kaggle. The dataset's specific size, features, and collection methodology are not detailed in the available metadata. Its content and structure require verification after download.
Malware Spllited info is a dataset hosted on Kaggle. Its specific content and structure are not detailed in the available metadata. The dataset likely contains information related to malware, possibly split across different categories or features.
Malware_dataset_grayscale likely contains image representations of malware samples. The dataset is hosted on Kaggle, but its size, author, and update history are unknown. Columns may suggest grayscale image data intended for machine learning tasks.
A Kaggle dataset titled RGB-MALWARE. Its content likely relates to malware analysis using RGB image representations. The dataset's author, organization, size, and temporal coverage are unknown.