Loading...
Loading...
Source code corpora, bug reports, vulnerability databases, network intrusion detection, malware samples
1,560 datasets
Washington State campaign expenditures from candidates and political committees over the last 10 years. The Washington State Public Disclosure Commission compiled the data from legally mandated reports including forms C3 and C4. Records include in-kind contributions and are updated as of April 2026.
Cash and in-kind contributions reported to the Washington State Public Disclosure Commission for candidates and political committees over a ten-year period. The dataset includes contributions from forms C3, C4, and Schedule C, but excludes mini-reporting filers and paid or forgiven loans. Data is provided by the Washington State Public Disclosure Commission and was last updated in April 2026.
A collection of security scenario files authored by Vyber07, last updated on 2026-05-31. The dataset includes dedicated sections for AI/ML security and API security, each containing multiple JSON files with real-world examples and real-time scenarios. The structure suggests a focus on practical, documented security incidents and attack patterns.
A metadata-only dataset of GitHub commits designed for large-scale AI and software engineering research, created by adhyanshaa and last updated on June 5, 2026. It aims to solve the problem of training models on open-source software history without managing large volumes of raw code. The dataset is accompanied by the GitScope CLI tool for exploration.
Public Services and Procurement Canada provides briefing materials prepared for the Minister's and Deputy Minister's appearances before the Standing Committee on National Defence. The materials are available in HTML format and were last updated on 2026-05-19. The dataset is published under the OGL-CA-2.0 license.
August 2014 to May 2016 is the time period covered by this dataset, which records the attendance of Members at committees. The data is hosted on both EU and UK government open data platforms, indicating its relevance for cross-jurisdictional transparency. Columns likely contain identifiers for members, committees, dates, and attendance statuses.
Ramsar sites are wetlands designated under the international Ramsar Convention. The dataset, provided by the UK Government Digital Service, includes sites designated since the UK's first in 1976, with an initial emphasis on areas important for waterbirds. The data is available in geospatial formats like GeoJSON and ESRI Shapefile.
The Australian Stratigraphic Units Database (ASUD) is the national authority on stratigraphic names in Australia, containing about 17,500 currently approved names and over 36,000 variations. This information is based on over 16,000 published references and is maintained by Geoscience Australia on behalf of the Australian Stratigraphy Commission. The database originated in 1949 and has been maintained electronically since 1979.
Public Services and Procurement Canada provides monthly commitment trackers and supporting documents detailing progress on HR and pay services. This collection addresses both current operations, focused on maintaining the existing pay system, and transformation initiatives exploring AI and new technology to replace the Phoenix pay system and 30 HR systems. Data for the latest quarter is available online, with commitments reviewed and adjusted annually.
Global Affairs Canada recognized its Ukraine team with the 2022 Award of Excellence for the humanitarian, development and peace nexus. The dataset, published on the open_canada platform, documents this employee recognition. It was last updated on 2026-05 08.
An aquatic substrate spectral library hosted in the National Spectral Database. The data was collected for the Adelaide Coastal Waters Study in 2003 and cited in a final technical report published in 2007. The dataset is managed by Geoscience Australia Data.
The Aquatic Substrate Library - Bolivar 2001 is a spectral dataset hosted in Australia's National Spectral Database. It was created by researchers from David Blackburn Environmental Pty Ltd and CSIRO Land and Water for a remote sensing study of marine and coastal features. The data was referenced in a 2007 technical report for the Adelaide Coastal Waters Study.
10,000 supervised examples for classifying email and email-adjacent content. Each JSONL row contains an instruction plus text input and a structured JSON output with fields for triage, priority, and risk. The dataset was created by weijianzhg and was last updated on HuggingFace in May 2026.
Briefing Materials prepared for the Ministerβs appearance before the Committee of the Whole. The materials are published by Public Services and Procurement Canada under the OGL-CA-2.0 license. The dataset was last updated on 2026-05-21.
Briefing materials prepared for the Minister of Public Services and Procurement Canada's appearances before the Standing Committee on COVID-19. The dataset consists of HTML documents published by the Canadian government under the OGL-CA-2.0 license. It was last updated on May 21, 2026.
The Gazetteer of Australia is the authoritative data source for approved place names across Australia's mainland, external territories, and offshore areas. Compiled by Geoscience Australia on behalf of the Committee for Geographical Names in Australasia, the 2010 release consists of over 300,000 place names. Data is sourced from State and Territory jurisdictions and Australian Government agencies.
The Peace Agreement Amnesties Dataset documents provisions for amnesty or pardon within formal peace agreements. It contains data on 117 peace agreements providing amnesty in relation to internal armed conflicts from 1990 to 2024 across 47 countries. The dataset was created by Louise Mallinder of PeaceRep Data, derived from the PA-X Peace Agreement database.
A government report from the Bureau of Mineral Resources (BMR) committee outlining a forward marine program. The report is published by the Australian Ocean Data Network and was last updated on 2026-06-04. The content is available in PDF and HTML formats.
The Executive Level Union-Management Consultation Committee on Phoenix was created to discuss issues with the Phoenix pay system. Its mandate is to identify problems and recommend solutions, including lessons learned. The committee meets quarterly and is co-chaired by the Secretary of the Treasury Board of Canada Secretariat and the National President of the Public Service Alliance of Canada.
A replication package provides data and scripts for an empirical study on the relationship between architectural smells and mock-related test quality. The package is authored by Paloma Passos and was last updated on May 17, 2026. It is shared under a CC-BY-4.0 license on the figshare platform.