Loading...
Loading...
Source code corpora, bug reports, vulnerability databases, network intrusion detection, malware samples
1,560 datasets
Global Affairs Canada published the Deputy Minister of International Development's appearance before the Standing Committee on Foreign Affairs and International Trade. The testimony occurred during International Development Week and addresses the current global climate. The dataset was last updated on May 12, 2026.
2,393 pull requests from the OpenHands platform, of which 1,737 were merged, spanning 667 unique repositories. The dataset includes activity metadata such as repository state snapshots and modified-file records, with a total of 14,186,956 lines added and 1,732,609 lines deleted. It was authored by inaesh-joshi and last updated on June 2, 2026.
Funding decisions, committed amounts, and disbursements for the FCC's Rural Health Care Program, collected from self-reported FCC Forms 462, 466, and 466A. The data includes details on applicants, service providers, funding years, and geographic locations of participating health care providers. It is published by datahub.usac.org and was last updated in April 2026.
Fisheries and Oceans Canada applied a national vulnerability assessment framework to marine biota in the Maritimes Region. The research document, 230 pages long and published in 2024, details the adaptation of the framework to create regional biological sub-groups and score them against exposure, sensitivity, and recovery criteria. This analysis produced a ranked list of sub-groups most vulnerable to ship-source oil spills to inform regional spill response strategies.
A 1.2 GB dataset of mature open-source Java projects from the Apache Software Foundation, selected for their diverse use of third-party libraries. It includes source code, dependency information, and GitHub pull request discussions, with static analysis results from PMD and Designite to capture code- and design-level technical debt indicators. The dataset was created by Ciprian-Viorel Stupinean and last updated on 2026-04-25.
A dataset from the Dutch Ministry of the Interior and Kingdom Relations concerning national parks in the province of Drenthe. It includes borders for the Dwingelderveld, Drents-Friese Wold, and Drentsche Aa parks, as well as the border of the Beek and Esdorpen landscape. The data also contains the opinion of the Drentsche Aa Committee on national parks with broadened objectives, commissioned by Secretary of State Ms G. Faber.
North America continent is the focus of this simulated rasterized water surface elevation and inundation-extent product for the Surface Water and Ocean Topography (SWOT) mission. It is a derived product, resampling upstream pixel cloud data onto a uniform grid at 100 m and 250 m resolutions in a UTM projection. The dataset is described as a simulated product and not suited for scientific exploration.
Leicester City Council's pay structure for staff covered by national joint councils, applying to most non-school staff. The table details a 15-grade pay scale with an annual salary range from Β£20,258 to Β£71,957 as of 1 April 2022. The dataset is provided by the Government Digital Service via the EU Open Data platform.
Over 300,000 authorized place names for the Australian mainland, its external territories, and offshore areas up to the 3-mile marine limit. The Gazetteer of Australia is compiled by Geoscience Australia's Geospatial and Earth Monitoring Division on behalf of the Committee for Geographical Names in Australasia. Data is sourced from Australian state and territory jurisdictions and federal agencies, with a release noted from 2010.
A statutory UK government register tracks brownfield sites suitable for housing. The data likely contains site details, planning permission status, and deliverable status, including completed sites. The Government Digital Service organized this data to support a 2020 target for 90% of suitable sites to have housing planning permission.
JNCC's public inventory lists data assets held by the UK's Joint Nature Conservation Committee, published in accordance with a government transparency agenda. The workbook, based on an extract from the Topcat metadata catalogue, is intended to encourage enquiries about data availability. It explicitly excludes datasets containing personal data, most third-party licensed data, and internal administrative or document-based reports.
Los Angeles Police Department arrest incidents from 2020 to April 30, 2025, transcribed from original paper reports. The dataset includes 30 columns such as Age, Arrest Date, Charge, Location, and Council Districts. It is published by data.lacity.org and was last updated on March 4, 2026, but is now a static historical record as LAPD transitions to a new NIBRS-compliant reporting system.
UK government data details spending on agency and consultant costs by directorate for the 2014/15 fiscal year. It constitutes the raw data behind a table published in the Performance and Contract Management Committee report. The dataset is hosted on multiple open data platforms, indicating its public importance for financial transparency.
Australia's Intergovernmental Committee on Surveying and Mapping compiled a report cataloging all elevation data available across the nation's jurisdictions in 2008. The audit was conducted by the ICSM's Permanent Committee on Topographic Information. The report is provided by Geoscience Australia Data.
A 92.4% reduction in inference latency was achieved by a student model with only 4.8 million parameters, enabling real-time processing. The framework introduces TS-GAN, a temporally-aware generative adversarial network, to align feature distributions between source and target domains for cross-domain adaptation. In a transfer task from UNSW-NB15 to CIC-IDS2017, the model achieved a 90.13% F1-score and demonstrated resilience under 15% feature perturbation.
Videos of decision-making body sessions lists recorded sessions of Montreal's political authorities. The dataset is published by the Government and Municipalities of QuΓ©bec under a CC-BY-4.0 license. It was last updated on 2026-04-17.
A spectral library of aquatic substrates from the Adelaide Coastal Waters, collected in 2003. The data is hosted in the National Spectral Database (NSD) and was produced by David Blackburn Environmental Pty Ltd and CSIRO Land and Water for the Adelaide Coastal Waters Study. The final technical report was published in July 2007.
The MNCR Area Summaries dataset compiles survey data on habitats and communities within fourteen marine inlets in south-west Britain. The core data originates from the Harbours, Rias and Estuaries (HRE) programme conducted by the Field Studies Council Oil Pollution Research Unit between 1985 and 1989. This information, combined with other sources, was analyzed by the Joint Nature Conservation Committee to classify marine biotopes and describe their distribution.
A 2003 spectral library of aquatic substrates from Adelaide coastal waters, hosted in the National Spectral Database. The data was collected for a remote sensing study of marine and coastal features and changes related to natural and anthropogenic processes. The final technical report was prepared for the Adelaide Coastal Waters Study Steering Committee in July 2007.
Management Plans provide a simpler alternative to Forest Plans for woodlands under 100 hectares, required for SRDP grant eligibility. The dataset contains attributes such as plan reference numbers, dates, agreed area, conservancy, and grid references. It is provided by the Government Digital Service.