Loading...
Loading...
Source code corpora, bug reports, vulnerability databases, network intrusion detection, malware samples
1,561 datasets
Pretokenized text chunks formatted as packed sequences of 513 tokens each, with no cross-document bleeding. The dataset was created by Beetle-Data and its finalization marker was committed on May 18, 2026. It is hosted on the Hugging Face platform.
North Lincolnshire Council's local development plans for committed industrial land use, represented as geospatial polygons. The dataset is provided by the Government Digital Service via the eu_open_data platform. The specific scale, update frequency, and temporal coverage are not detailed in the available metadata.
Ontario government data tracks service delivery performance across channels like telephone, in-person visits, email, and websites. The dataset includes results estimated through sampling and mystery shopping where direct tracking was unavailable. It was last updated in March 2026.
Original data supporting a quantitative analysis of joint bodies in preferential trade agreements, published by Markus Gastinger and Andreas Dür in 2026. The dataset likely contains variables measuring the institutionalization of joint committees, such as meeting frequency and decision-making powers. It was used to test arguments about monitoring commitments, facilitating negotiations, and avoiding domestic ratification.
A 2025 briefing document prepared for the Minister of Pacific Economic Development Canada for a Committee of the Whole session. The document was published by Pacific Economic Development Canada on the open_canada platform and last updated on April 9, 2026. Its content likely contains background information, analysis, or policy positions relevant to the committee's discussions.
Canada's Treasury Board President appearance before the House of Commons Committee of the Whole in June 2025. The dataset covers discussions on the Main Estimates and Supplementary Estimates (A) for the 2025-26 fiscal year. It was published by the Treasury Board of Canada Secretariat and last updated on April 9, 2026.
New Orleans Redevelopment Authority's inventory lists properties not under contract. The dataset includes property addresses, zoning classifications, and council districts. It is published by data.nola.gov and was last updated in April 2026.
A government report from the BMR committee outlining a forward marine program, published via the Australian Ocean Data Network. The report is available in PDF and HTML formats. Metadata is minimal, with the raw description stating 'Legacy product - no abstract available'.
2007 to 2015 records detail recipients of the Innovation Demonstration Fund, a green technology pilot program paused in 2013. The dataset includes funding program, company name, location, fiscal year contract signed, and government funding commitment.
Test cases for intelligent parsing of scientific literature, published on figshare by Yang Yuan. The dataset is 172.9 MB in size and includes files in JSON and PDF formats. It was last updated on April 16, 2026.
A dataset containing Turkish language question-answer pairs about automobile maintenance, faults, and problems. The data is provided in Parquet format. The author notes that the dataset may contain errors and gaps, and contributions are accepted via pull requests.
Farideh Motaghian's Python source code package, uploaded to figshare on 2026-04-10. The 138.0 KB ZIP file includes model architecture, data preprocessing scripts, and evaluation metrics for a machine learning project.
A 2001 spectral library from Bolivar, Australia, containing reflectance data for aquatic substrates. The dataset is part of the National Spectral Database and was contributed by the Australian Ocean Data Network. It supports remote sensing studies of marine and coastal environments.
Adelaide coastal waters spectral data from a 2003 study supports remote sensing of marine environments. The dataset is part of the Australian National Spectral Database, managed by the Australian Ocean Data Network. It was created by Blackburn and Dekker for the Adelaide Coastal Waters Study, with a final technical report published in 2007.
A report from an open meeting of the JOIDES planning committee held in Zurich in September 1973. The document discusses the future of the Deep Sea Drilling Project after 1975. It is published by Geoscience Australia Data and was last updated on the platform in April 2026.
Since the 2013 general election, this dataset tracks cash and in-kind contributions reported within 7 days of a primary or 21 days of a general election in Washington State. It includes contributions made to candidates and political committees. Columns suggest detailed tracking of contributors, recipients, amounts, and election context.
Information about the members of the executive committee of the Burshtyn City Council. The dataset originates from the States site of Ukraine and was last updated on 2026-04-17. It is available in CSV format.
Parks Canada prepared a briefing package for its President and CEO to appear before the Standing Committee on Canadian Heritage on September 24, 2025. The document is available in PDF format under the OGL-CA-2.0 license and was last updated on the open_canada platform in April 2026.
A 2008 report outlines all elevation data available across Australian jurisdictions. The audit was identified by the Intergovernmental Committee on Surveying and Mapping's Permanent Committee on Topographic Information. The dataset is provided by the Australian Ocean Data Network.
A dataset collects function-level information from ArkTS (HarmonyOS Ark TypeScript) projects. It includes original functions, docstrings, abstract syntax tree representations, obfuscated versions, and source code metadata. The dataset was created by author hreyulog and last updated on April 15, 2026.