Loading...
Loading...
Source code corpora, bug reports, vulnerability databases, network intrusion detection, malware samples
1,561 datasets
National Elevation Data Audit is a report outlining all elevation data available across all Australian jurisdictions. The audit was identified by the Intergovernmental Committee on Surveying and Mapping's (ICSM) Permanent Committee on Topographic Information (PCTI). The dataset is provided by Geoscience Australia Data and was last updated on 2026-04-20.
Canadian briefing material prepared for the Standing Committee on Official Languages appearance of Minister Miller on February 3, 2026. The document, published by Canadian Heritage, discusses the use of French in government communications. It was last updated on April 16, 2026.
A synthetic token-classification dataset created by enosislabs for training a local privacy and security filter. The dataset is designed to teach a filter to detect and redact sensitive information spans in developer workflows before data is sent to external large language models. It was last updated on Hugging Face on 2026 05 02.
A legacy report from the Bureau of Mineral Resources (BMR) committee outlining a forward marine program. The document is published by the Australian Ocean Data Network and is available in PDF and HTML formats. Metadata is minimal, with the abstract and data specifics marked as unavailable.
The Department of Education of the Pokrovsk City Council Executive Committee provides this set of budget program passports. Each resource in the set has a separate passport, likely detailing financial plans and objectives. The data was last updated on 2026-04-27 and is sourced from the States site of Ukraine.
Data on appeals received by the hotline of the Bolekhiv City Executive Committee in Ukraine. The dataset likely contains records of calls to municipal telephone services, emergency dispatch, or citizen contact centers. It was last updated on the platform in April 2026.
21.7 million rows of development metadata from 17 public GitHub repositories, fetched via the GitHub REST and GraphQL APIs. The data is structured across 8 tables covering issues, pull requests, comments, and other events, totaling 1.5 GB in compressed Parquet format. It was created by open-index and last updated on 2026-04 10.
A 2026 resource from the States site of Ukraine contains data on the regulatory framework of the Department of Urban Development and Architecture of the Executive Committee of the Rivne City Council. It includes the name, adoption date, adopting body, and a link to each published normative act.
Pretokenized chunks of text formatted as packed sequences of 513 tokens each, with no cross-document bleeding. The dataset was created by Beetle-Data and its metadata was last updated on May 18, 2026. It is sharded incrementally, with a marker file committed upon finalization.
Statistical information on the receipt of citizens' appeals to the executive committee of the Glukhiv City Council and requests for public information. The data reports on the results of their consideration and the implementation of the executive committee's work plan. It originates from the States site of Ukraine and was last updated on 2026-04-17.
Australia's authoritative gazetteer provides over 300,000 approved place names for the mainland, external territories, and offshore areas to the 3-mile marine limit. The 2010 release was compiled by Geoscience Australia's Geospatial and Earth Monitoring Division on behalf of the Committee for Geographical Names in Australasia. Data is sourced from Australian state, territory, and federal government agencies.
Multiple Ukrainian city councils, including Burshtyn and Kamianka, publish registers of citizen appeals received via telephone hotlines and social departments. These datasets likely contain counts and results of appeal considerations, offering a ground-level view of local governance and public service responsiveness. Their cross-platform presence indicates a standardized municipal reporting practice for citizen feedback.
An Australian National Spectral Database library containing spectral signatures of aquatic substrates. The data was collected for the Adelaide Coastal Waters Study and documented in a 2007 technical report by Blackburn and Dekker. Access is managed through the Australian Ocean Data Network and the Commonwealth Scientific and Industrial Research Organisation.
A 2001 spectral library for aquatic substrates in Bolivar, part of the Australian National Spectral Database. The data was used in a 2007 remote sensing study of marine and coastal features for the Adelaide Coastal Waters Study. It is hosted by the Australian Ocean Data Network and last updated in April 2026.
Performance metrics for a Deep Reinforcement Learning (DRL)-based network intrusion detector evaluated on the NF-BoT-IoT dataset. The dataset is a 5.5 KB Excel file authored by Khorshed Alam and last updated in April 2026. It is licensed under CC-BY-4.0.
Khorshed Alam's 9.5 KB Excel file compares methods for classifying network intrusions. The dataset likely contains key features used to distinguish between normal and malicious network activity. It was last updated on April 16, 2026.
Khorshed Alam published a dataset on hardware requirements for intrusion detection systems using deep reinforcement learning (DRL) on figshare in April 2026. The dataset is a 5.5 KB Excel file, suggesting a small, focused collection of specifications or performance metrics. It is licensed under CC-BY-4.0 for open use.
Documents prepared for the Deputy Minister/President of Canada Economic Development for Quebec Regions for an appearance before the House of Commons Standing Committee on Official Languages on November 1, 2023. The dataset is provided by the Canada Economic Development for Quebec Regions organization and was last updated on the open_canada platform in April 2026. The content is available in HTML format under the OGL-CA-2.0 license.
A dataset curated for the TabArena Tabular ML IID Study, intended for evaluating predictive models on independent and identically distributed tabular data. The data originates from a 2012 IEEE study on phishing website features and is licensed under CC BY 4.0. All features are categorical, ordinal-encoded variables with at most 3 values.
Data and source code from a study on soil pH as an external filter shaping insectβmicrobe gut symbiosis. The dataset is 103.7 KB in size and was last updated on 2026-04-21. It was authored by Hideomi Itoh and is shared under a CC-BY-4.0 license.