Loading...
Loading...
Source code corpora, bug reports, vulnerability databases, network intrusion detection, malware samples
1,590 datasets
242,000 URLs have been processed with engineered features for phishing detection. The dataset likely contains attributes designed to distinguish malicious URLs from legitimate ones. Its origin, author, and specific feature definitions are unknown.
A medical image dataset likely containing annotations for seven anatomical structures: liver, kidney, hepatic vessel, pancreas, colon, lung, and spleen. The dataset was sourced from Kaggle, but the author, organization, and specific collection details are unknown. The last update date and dataset size are also unspecified.
SoloSpeak source code is available on Kaggle. The dataset's specific contents, such as the programming language, project size, and purpose, require verification after download. Metadata is minimal; details about the author, organization, and last update are unknown.
QuantFlow source code is a dataset published on Kaggle. The dataset's content and structure are not described in detail. Further verification is required to determine its specific contents and intended use.
A collection of documents related to the Committee of Twenty, a group formed in the 1970s to reform the international monetary system. The dataset is authored by Michael D. Bordo and is hosted on the paperswithcode platform. The specific content, format, and scale of the documents are not detailed in the available metadata.
"The Ghost in the Attic: The Soviet Union as a Factor in Anglo-American Wartime Postwar Planning for Postwar Germany, 1943-1945" is a historical paper published by Brill. It was presented at the International Committee for the History of the Second World War in San Francisco on August 26, 1975. The paper examines the influence of the Soviet Union on Allied planning for Germany's future during the final years of World War II.
American Jewry and the Holocaust: The American Jewish Joint Distribution Committee, 1939-1945 is a dataset published on paperswithcode. The dataset likely contains historical records or documents related to the American Jewish Joint Distribution Committee's activities during the Holocaust period. The dataset author is Yehuda Bauer.
U.S. Congress. Senate. Subcommittee on Africa produced a hearing transcript titled 'Trade Sanctions against Rhodesia: Hearing before the Committee on Foreign Relations, United States Senate. 96th Cong., 1st sess., June 12, 1979'. The dataset likely contains the full text of the congressional hearing, including statements, testimonies, and discussions. It was published on the paperswithcode platform.
Arthur Hendrick Vandenberg authored this collection of papers related to U.S. foreign policy toward China. The dataset likely contains textual documents from the critical post-World War II period. Its content is hosted on PaperswithCode, a platform for datasets associated with research.
A dataset related to intrusion detection systems using artificial intelligence. The dataset is hosted on Kaggle. Its specific content, size, and creation details are unknown.
A collection of emails labeled as phishing or legitimate, intended for training machine learning models. The dataset's origin, size, and specific features are not detailed in the provided metadata. Further inspection of the downloaded files is required to determine its full scope and composition.
Anonymized quantitative survey data on cybersecurity resilience factors from organizational stakeholders. It includes assessments of governance, internal controls, knowledge, attitudes, and perceived vulnerabilities. The data was collected by Raymond Friedman via the Comprehensive Cybersecurity Resilience Factors Assessment Survey for Stakeholders.
Source code for a tool named 'split_mask', authored by Steven Mocking of MGH CCNI Helper Programs. The code was last updated on March 18, 2026. Its specific functionality likely relates to image segmentation or mask processing.
Startup GitHub Engineering Velocity Panel tracks GitHub commit velocity across 55 venture-backed startups over five quarters. The dataset likely contains metrics related to developer activity and productivity over time. It was sourced from Kaggle, but the author, organization, and specific collection method are unknown.
A database of discrete capital investment projects from New York City's Capital Commitment Plan, uniquely identified by Financial Management Service (FMS) IDs. The dataset includes information on sponsoring and managing agencies, project descriptions, and financial commitments. It was last updated on December 10, 2025, and is hosted by data.cityofnewyork.us.
SongTonyLi published a dataset titled Cve Nvd on huggingface in March 2026. The dataset likely contains entries from the National Vulnerability Database (NVD), which is a repository of standards-based vulnerability management data. The platform tags suggest it is stored in Parquet format and optimized for use with libraries like pandas and polars.
Kaggle hosts the Unified Network Intrusion Detection Dataset (UNID). The dataset likely contains features for classifying network traffic as benign or malicious. Its specific size, origin, and update history are not detailed in the provided metadata.
CVE Decision is a dataset published on Kaggle. The title suggests it likely contains records related to Common Vulnerabilities and Exposures, a standardized system for identifying security flaws. Specific details on the number of records, columns, and collection methodology are unavailable from the provided metadata.
Phishing Website Detection Dataset Lakshay is a dataset for identifying malicious websites, hosted on Kaggle. The dataset's specific features, size, and collection methodology are not detailed in the provided metadata. Its content and structure require verification after download.
A collection of test cases published on Kaggle. The dataset likely contains examples and scenarios used for validating software functionality. The author, organization, and specific content details are unknown and require verification after download.