Loading...
Loading...
Text classification, translation, QA, summarization, dialogue, sentiment analysis, language modeling, text corpora
44,558 datasets
Yukon's Brown McDade Mine is a gold-silver discovery in an unglaciated area between Whitehorse and Dawson. The thesis details its geologic structure, mineralization, and hydrothermal alteration, with drilling data from 1946 indicating a 1000-foot ore length. The resource is a digitized thesis from the University of British Columbia.
Howland Research Forest in central Maine has aboveground biomass estimates at 10-meter spatial resolution for 2012, 2015, 2017, 2021, and 2023. The dataset was created by ORNL_CLOUD using a random forest model calibrated with in-situ plot measurements and simulated biomass from the LANDIS-II model, applied to airborne LiDAR data from USGS 3DEP and NASA G-LiHT projects. Data are provided as cloud-optimized GeoTIFF files.
15 self-contained web applications with 110 documented vulnerabilities for evaluating real-world web exploitation. The benchmark includes target environments, exploits, reports, and verifiers, covering 0-day, 1-day, and synthetic vulnerabilities. It was created by AgentCyberRange and last updated on 2026-06-16.
A research dataset accompanies a proposed supervised heterogeneous Gaussian graphical model for analyzing data with continuous or discrete response variables. The dataset includes spectrometric data used to illustrate the model's performance, as published by author Xin Zeng in 2026. The total package is 2.2 MB, containing text files, a PDF, and compressed archives.
419 wolverine muscle samples were analyzed for total mercury and other metals to examine spatial variation in bioaccumulation across Arctic regions. The dataset, generated by Environment and Climate Change Canada, provides a baseline for investigating long-term impacts of atmospheric mercury deposition. A subset of animals also had metals measured in brain, liver, kidney, and hair tissues.
Evaluation reports from Global Affairs Canada reviewing the performance of programs and projects. The reports serve as a practical management tool, using gathered information to improve the design and implementation of upcoming initiatives. The specific report is a formative evaluation of the Canadian Francophonie Scholarship Program (CFSP) Phase VI - Development Component.
Urban planning data from the repealed Montreal Urban Plan regulation 04 047, now superseded by the Urban and Mobility Plan 2050. The dataset expresses building density across the city using a 17 color scale integrating various density parameters.
Two vector datasets provide generalized geographic and administrative basemaps for Quebec at 1:1,000,000 and 1:5,000,000 scales. The data includes major hydrographic groups, transport infrastructure, agglomerations, and administrative division limits, derived from a more detailed 1:250,000 scale source.
Two complementary studies evaluated a novel fluralaner tablet formulation in dogs. The pharmacokinetic study measured plasma concentrations in 12 dogs up to 49 days post-treatment, reporting a peak concentration of 3,763 μg/L and an elimination half-life of 17.88 days. The efficacy study, involving 12 dogs split into treated and control groups, demonstrated sustained flea and tick control above regulatory thresholds for 45 days.
The Havering Data Intelligence Hub provides data, information and research about the London Borough of Havering. Hosted by the Greater London Authority, it aims to benefit the local authority, its partners and the public in understanding key information about the borough. The dataset was last updated on 2026-06-24.
October 2025 polling data commissioned by the London Assembly, capturing Londoners' views on raising children in the city. The dataset was published by the Greater London Authority and last updated in June 2026.
Yukon's 2021-22 Mineral Exploration Program (YMEP) allocated $1.4M in funding. The Canadian Northern Economic Development Agency contributed an additional $80,000. In 2021, 76 applications sought over $2.4M, resulting in 52 funded projects across Grassroots, Focused Regional, Target Evaluation, and Placer modules.
Geoscience Australia Data provides a geological reconnaissance report on the Macdonald and Rawlinson sheet areas. The report describes an aggregate thickness of 27,000 feet of Precambrian sedimentary rocks, divided into Lower and Upper Proterozoic sequences. It was last updated on 2026-05-10.
Over 1,000 reported environmental releases are documented by the Connecticut Department of Energy and Environmental Protection from July 1, 2022 onward. Records include administrative details, spill source, chemicals involved, and location data. This dataset is a public snapshot from the state's live incident reporting system, updated approximately once a month.
64 female Sprague-Dawley rats were used in a DMBA-NMU induced breast cancer model to evaluate the effects of Pleurotus ostreatus extract and vincristine. The dataset likely contains measurements of antioxidant status, hormonal profiles, histopathology, and PI3K/Akt/mTOR pathway protein expression levels after 25 weeks. Magdalene Eno Udobi published the data on figshare in April 2026.
WeaveBench is a real-world benchmark for evaluating computer-use agents across hybrid interfaces. The dataset, created by wanlilll and last updated on June 5, 2026, assesses an agent's ability to orchestrate visual desktop control, command-line execution, code editing, browsers, and external tools within a single long-horizon workflow. The associated paper reports a best observed pairing of Claude Opus 4.7 + Claude Code achieving a 41.2% PassRate.
AgentCyberRange created PostExploitBench, a benchmark for multi-host post-exploitation tasks. The complete dataset contains 8 self-contained cyber ranges modeling enterprise-like environments with 156 target hosts. The dataset was last updated on June 16, 2026.
A structured dataset of formalizations from the Coq-HoTT library, which implements Homotopy Type Theory in the Coq proof assistant. The dataset contains 589 files sourced from a specific GitHub commit. It was created by user 'phanerozoic' and last updated on Hugging Face in June 2026.
A morphological study of living oak pollen from California conducted by researchers from the University of California, Berkeley and the University of Exeter. The dataset likely contains high-resolution images from scanning electron microscopes and light microscopes. The data is associated with an open-access research paper.
IHBench evaluates post-interruption recovery in voice agents executing structured, multi-step workflows. The benchmark contains 45 scenarios to measure if an agent resumes correctly, addresses user interjections, and avoids re-delivering content. It was created by bosonai and last updated on Hugging Face in June 2026.