Loading...
Loading...
Source code corpora, bug reports, vulnerability databases, network intrusion detection, malware samples
1,591 datasets
Tracking data from the 1990s onward, compiled by the Scientific Committee on Antarctic Research (SCAR) project RAATD. It consolidates movement records for 17 species of Antarctic meso- and top-predators, including birds and marine mammals. The provided data files contain filtered position estimates processed with a state-space model for regular time intervals.
Kaggle hosts a dataset titled 'Malware Classification', likely containing features for categorizing malicious software. The dataset's specific size, origin, and update history are not provided in the available metadata. Its content and structure require verification after download.
A collection of website screenshots labeled as phishing or legitimate. The dataset is hosted on Kaggle, but the total number of images, creation date, and author are unknown. The content likely consists of visual captures of web pages intended for security analysis.
Global Cybersecurity Index data published by the World Bank and International Telecommunication Union (ITU). The dataset likely contains country-level metrics for assessing cybersecurity commitment and capabilities. It covers the period from 2020 to 2024.
A dataset published on Kaggle covering cybersecurity topics from 2020 to 2024. The specific content, such as network logs, attack signatures, or vulnerability reports, is not detailed in the available metadata. The dataset's author, organization, and exact size are unknown.
A dataset for Android malware analysis, published on Kaggle. The raw description indicates it is the latest version for this purpose. Specific details on size, origin, and collection methodology are not provided in the available metadata.
2.2 million Android application records sourced from the Google Play Store, detailing the specific permissions requested by each app. The dataset provides a large-scale view of mobile application security and privacy requirements across the entire ecosystem.
Verified phishing URLs and malicious links categorized by 'Valid' status from the PhishTank database. These real-world examples support the training and evaluation of anti-phishing detection systems.
Android malware samples likely collected in 2020, as suggested by the dataset's codename. The dataset is hosted on Kaggle and is tagged for machine learning and cybersecurity applications. The specific author, collection method, and data volume are not provided in the available metadata.
Giving access to replication materials for the study 'Observing the Unobservable: Wargaming Cyber Deterrence' by Andrew Redding. It supports the replication of analyses and findings from the associated 2026 publication.
86% of dated volcanic samples from Mt. Waesche correlate with interglacial periods, suggesting a strong link between ice sheet changes and volcanic activity. This dataset integrates petrologic records, 40Ar/39Ar eruption ages, and geodynamic modeling to study glaciation's effects on crustal stress. It was published by AMD_USAPDC via NASA EarthData and was last updated in July 2024.
2013-2017 voting records for motions at Edmonton City Council and selected committee meetings. The data likely contains votes cast by individual councillors, recorded as 'in favour', 'opposed', or 'absent'. It is provided by data.edmonton.ca and was last updated in May 2022.
2011-2013 voting records for motions at Edmonton City Council and selected committee meetings. The data shows each VOTER's VOTE ('in favour', 'opposed', or 'absent') linked to MOTION_ID, ITEM_ID, and MEETING_ID. DataEdmonton.ca published this structured record, last updated in May 2022, with the official record remaining the City Clerk's approved documents.
A National Vaccine Advisory Committee report outlines quality standards and guidance for evaluating adult immunization programs in nontraditional settings. The document likely contains policy recommendations, evaluation frameworks, and programmatic guidance. It was published on the paperswithcode platform, but its original publication date and author details are not provided.
Kaggle hosts a dataset focused on phishing and spam content. The dataset's specific size, features, and collection method are not detailed in the provided metadata. Its author, organization, and last update date are currently unknown.
The Physical Activity Guidelines Advisory Committee Report, 2008 to the Secretary of Health and Human Services is a formal advisory document. It likely contains scientific reviews and policy recommendations related to physical activity and public health. The dataset is sourced from the paperswithcode platform.
CRAVE provides 1,200 high-quality samples for code review classification, released by TuringEnterprises in late 2025. The data covers 600 pull requests across 123 distinct repositories, focusing on binary classification of code changes.
38,467 records representing approximately 30,000 unique binary executables totaling ~33.41 GB. The dataset was created by mjbommar and last updated on November 14, 2025. It is designed for machine learning research in binary analysis, malware detection, and program understanding.
A curated multi-source corpus of phishing emails focused on threats to the education sector. The dataset is hosted on Kaggle and appears designed for security and NLP tasks. Specific details on size, authorship, and licensing are not provided in the input metadata.
Curated by 0xh3xa and updated through January 2026, this repository aggregates links to malware and benign datasets for cybersecurity research. It organizes resources across Windows and Android platforms to facilitate malware classification and deep learning applications.