Loading...
Loading...
Source code corpora, bug reports, vulnerability databases, network intrusion detection, malware samples
1,562 datasets
Quarterly performance data tracks the measures of success outlined in the London Borough of Barnet's Corporate Plan. The report covers the Q4 period of the 2013-14 fiscal year, detailing key performance indicators for council operations.
London Borough of Barnet's quarterly performance report for Q1 of the 2016-17 financial year, detailing the council's progress against its Corporate Plan measures of success. The dataset is produced by the London Borough of Barnet and was last updated in March 2026.
Kaggle hosts this dataset containing source code related to dengue fever. The dataset likely contains software scripts or analysis code accompanied by explanatory notes. The author, organization, and specific details about the code's purpose are unknown.
A report by a BMR committee detailing a forward marine program, published by Geoscience Australia. The content covers marine survey planning and Earth sciences topics. Specific data volume and structural details are unavailable.
CVE-list-v5-160526 is a dataset of Common Vulnerabilities and Exposures identifiers published on Kaggle. The specific version or date referenced in the title suggests it may contain records from a particular snapshot. The dataset's exact content, size, and origin are not detailed in the provided metadata.
The dataset title suggests data from 2019, specifically week 38. It likely contains records of commitments or pledges made by the event industry related to Sustainable Development Goals (SDGs). The data was published on Kaggle, but the original author and specific collection method are unknown.
A curated collection of URLs designed for training and evaluating machine learning models to detect phishing attempts. The dataset is hosted on Kaggle, but specific details about its size, origin, and creation date are not provided. Its primary purpose is to serve as a benchmark for building and testing automated phishing detection pipelines.
From October 2018, this dataset contains all supplier spending by Rochdale Borough Council, with prior records covering transactions of £500 or more. It is published to increase government transparency and openness, with personal data redacted to comply with the Data Protection Act. The dataset is maintained by Rochdale Borough Council and was last updated in March 2026.
Austin Resource Recovery 2024 Annual Report details departmental accomplishments for the year. The City of Austin uses this report to track progress toward a zero waste goal of reducing landfill trash by 90% by 2040.
LAMDA contains over 1 million feature-engineered Android APK samples collected by IQSeC-Lab for malware detection and concept drift analysis. The dataset spans a longitudinal period from 2013 to 2025, excluding 2015, providing a benchmark for temporal model evaluation.
This text dataset contains between 1 million and 10 million records of unsealed court filings, FBI reports, and DOJ publications related to Jeffrey Epstein. Curated by Nikity and last updated in February 2026, the collection aggregates official investigative materials from the U.S. Department of Justice and House Oversight Committee. The data is provided in Parquet format for high-performance processing.
Donor commitments data tracks pledged financial contributions for global health initiatives, measured in millions of constant 2009 US dollars. The dataset is published by the World Health Organization (WHO) as part of its Global Health Observatory. Specific temporal coverage and volume details are not provided in the metadata.
Commitments to recipient countries (Million, constant 2009 US$) is a dataset published by the World Health Organization (WHO) on the WHO Global Health Observatory platform. The data likely contains financial commitments for health and development aid, measured in millions of constant 2009 US dollars. The specific time range, recipient countries, and number of records are not provided in the available metadata.
Dataset.Phishing.Url is a collection of URLs posted on Kaggle. The dataset likely contains web addresses for the purpose of phishing detection research. The author, data source, and specific details about the collection are not provided in the available metadata.
Containing human-vetted malicious software packages discovered in real-world environments by DataDog security researchers. Updated as of March 2026, the data focuses on identifying and documenting software supply chain threats found in the wild. It serves as a verified ground-truth set for security tools and malware analysis.
A collection of malware binary files labeled as CUIP-X25, hosted on Kaggle. The dataset likely contains executable files intended for security analysis and machine learning model training. Metadata is minimal; specifics on the number of samples, collection period, and author are unknown.
PhiUSIIL_Phishing_URL_Dataset is a dataset of URLs related to phishing, likely collected for security research. It is hosted on Kaggle, but details on its size, features, and creation are unspecified. The dataset's content and structure require verification after download.
2000 JSONL entries (SCV-1 to SCV-2000) focus on advanced and unconventional smart contract vulnerabilities and attack vectors, with an emphasis on Decentralized Finance (DeFi) protocols. The dataset was created by author gayan2002 and last updated on Hugging Face in February 2026.
Mask2Former is a state-of-the-art architecture for panoptic and instance image segmentation. This dataset, published on Kaggle, likely contains the source code and model implementation files. The specific contents, such as training scripts or configuration files, require verification after download.
Douglas L Clarke's academic study argues the MIA issue was detrimental to U.S. interests, families, and foreign policy. The text likely contains a qualitative analysis of government actions, family impacts, and diplomatic consequences. Its source is a thesis or research paper, but the underlying data format and size are unspecified.