Loading...
Loading...
Source code corpora, bug reports, vulnerability databases, network intrusion detection, malware samples
1,651 datasets
A table of issues related to protected areas under the Water Framework Directive, produced for reporting purposes for the European Flood Directive. The dataset was created by the Bureau de Recherches Géologiques et Minières and last updated on October 24, 2018. It is used to produce maps of exposed issues at an appropriate scale to inform flood risk management plans.
Homogeneous areas describing a type of economic activity on an IRR, produced for reporting purposes for the European Flood Directive. The data set is used to produce maps of exposed issues at an appropriate scale, contributing to flood risk management plans. It was produced by the Bureau de Recherches Géologiques et Minières and last updated on 2019-03-29.
A dataset from the Bureau de Recherches Géologiques et Minières (BRGM) used to produce maps of flood exposure for sensitive infrastructure. The data supports flood risk management plans mandated by European Directive 2007/60/EC and French national law. It was last updated on April 1, 2019.
A table of quantitative issues reported for each analytical grid and flood scenario, produced for reporting under the European Flood Directive. The dataset was created by the Bureau de Recherches Géologiques et Minières and last updated on March 29, 2019. It is used to produce maps of exposed issues at an appropriate scale, contributing to flood risk management plans.
European Directive 2007/60/EC mandates flood risk management plans to reduce impacts on health, environment, heritage, and economic activity. This dataset from the Bureau de Recherches Géologiques et Minières provides homogeneous zones describing economic activity types to map flood exposure issues. It was last updated on April 1, 2019.
Flood risk management plans required by European Directive 2007/60/EC aim to reduce negative consequences on health, environment, heritage, and economic activity. This dataset from the Bureau de Recherches Géologiques et Minières contributes to homogenizing knowledge of flood exposure for infrastructure. It is used to produce maps of exposed issues at an appropriate scale.
April 2019 data from the Bureau de Recherches Géologiques et Minières (BRGM) maps challenges for sensitive establishments and installations whose flooding may complicate crisis management. This dataset supports flood risk management plans by homogenizing knowledge of flood exposure, as required by European Directive 2007/60/EC and French national law. It is used to produce maps of exposed issues at an appropriate scale.
RepoBench v1.1 (Python) is a dataset for evaluating code generation models, derived from GitHub repositories. The collection spans from October 6th to December 31st, 2023, and has been deduplicated against the Stack v2 dataset to prevent data leakage. It was created by author 'tianyang' for the ICLR 2024 conference.
DiverseVul is a dataset of vulnerable source code for training deep learning models in vulnerability detection. The dataset was uploaded to HuggingFace by user 'claudios' on 2024-01-30. It originates from the research paper 'DiverseVul: A New Vulnerable Source Code Dataset for Deep Learning Based Vulnerability Detection'.
Records of green plantations designated for removal, based on inspection acts issued by the executive committee of the Pokrovsky district council in the city. The dataset is provided by the States site of Ukraine and was last updated on April 3, 2025. The specific number of records and temporal coverage are not detailed in the available metadata.
Tariffs for utilities approved by the Executive Committee of the Kamyanka City Council. The dataset was last updated on June 15, 2023 and originates from the States site of Ukraine. It is provided in CSV format.
100,000 to 200,000 verified smart contracts sourced from Etherscan.io and deployed on the Ethereum blockchain. The collection includes source code written in both Solidity and Vyper programming languages.
A subset of phishing site data originally sourced from Kaggle, used for a model compression example. The dataset was uploaded by user 'shawhin' to Hugging Face and last updated on September 1, 2024. It contains website URLs labeled as phishing (1) or not phishing (0).
Fortran 90 source code for computing seasonal fractional snow-covered area. The algorithm reads snow depth and snow water equivalent data from an example file and outputs fractional snow-covered area values. The code is hosted by ENVIDAT and was last updated in January 2021.
Reports on the implementation of budget programs for the Department of Education, Youth, Sports and National-Patriotic Education in Malyn, Ukraine. Each resource in the set is an archive of reports for the 2019 fiscal year. The dataset was published on the States site of Ukraine and last updated on the platform in February 2021.
MALS provides labeled data for detecting Advanced Persistent Threats. The dataset was uploaded by author 'tuandunghcmut' to HuggingFace in December 2024 after preprocessing into a standard format. Its original source is the APTM project codebase on GitHub.
A plan for the preparation of regulatory acts from the executive committee of Kamenetz-Podolsk City Council. The dataset includes project names, types of acts, adoption objectives, preparation timelines, and responsible bodies. It was last updated on 2024-01-04 and originates from the States site of Ukraine.
The Dresscode dataset, authored by JianhaoZeng, is a collection of fashion and clothing images. It was last updated on August 18, 2025, and is hosted on the Hugging Face platform. The dataset's specific size, row count, and license are not provided in the available metadata.
Financial statements for the Executive Committee of Sumy City Council, covering the budget of the Sumy city territorial community. The dataset was published on the States site of Ukraine and last updated on 2022-02-04 11:15:21.922863. It likely contains detailed budgetary and expenditure data for the specified three-year period.
Ukraine's Boryspil City Council Executive Committee contracts and related financial documents. The dataset contains a list of concluded contracts, annexes, and other materials sourced from the Unified web portal of public finance use (spending.gov.ua). It was last updated on August 10, 2021.