Loading...
Loading...
Image classification, object detection, segmentation, face recognition, OCR, image generation, video understanding
14,846 datasets
25 days of normal-operation data and 5 days per attack scenario were collected from a laboratory-scale 13-node three-phase unbalanced radial distribution network. The dataset includes additive, subtractive, and replay false data injection attacks, as well as denial-of-service attacks, with processed time-series and traffic features. It was created by Yulin Liu and last updated on 2026-04-12.
2023 estimates of forest aboveground biomass (AGB) for the state of Maine, USA. The data were generated using airborne LiDAR from the USGS 3DEP project and a deep learning CNN model calibrated with USDA FIA plot inventory data. The dataset provides a high-resolution (10-m) raster map projected to the year 2023.
Geoscience Australia produced digital bathymetry, gravity, and magnetic grids covering the southwest quadrant of Australia (24-46°S, 106-140°E). The results were obtained by performing a network adjustment on marine ship-track data and combining these with onshore and satellite-derived data. The work was done in cooperation with Desmond Fitzgerald & Associates and the Australian Hydrographic Office.
Lingzhou Zhao published a dataset on figshare in April 2026 detailing the development of a novel nanobody for radiotheranostics. The data likely contains results from in vitro and in vivo studies of the NB46 nanobody labeled with 68Ga and 177Lu, including affinity, biodistribution, and tumor growth inhibition metrics. The dataset is small, at 1.5 KB, and is provided in CSV format.
UK local authorities are required to publish details of all grants awarded to voluntary, community, and social enterprise (VCSE) organizations under the Local Government Transparency Code 2014. This dataset likely contains records of financial awards for the 2022-23 fiscal year, documenting public expenditure to support the third sector. Its cross-platform presence on UK and EU open data portals signals its importance for government transparency and accountability.
Daniel Nettersheim's research dataset contains proteomic and phosphoproteomic analyses of newly established primary human penile carcinoma cell lines and corresponding xenograft tumors. The data was generated using mass spectrometry and phospho-kinase arrays to identify therapeutic targets like heat shock proteins and histone deacetylases. It was last updated in April 2026.
Approximately 1 million academic papers from sources like arXiv and bioRxiv have been processed into a unified, multi-layered knowledge graph. The dataset, created by InternScience, decomposes each paper into five modules covering metadata, entities, abstracted knowledge, citation context, and fine-grained relations. It was last updated on June 12, 2026.
Australian Ocean Data Network presents a geochemical analysis of uranium speciation in organic-rich marine sediments. The dataset contains results from high-energy resolution fluorescence detection x-ray absorption spectroscopy on 11 samples from the Cretaceous Toolebuc Formation. Findings were submitted to the 2023 Goldschmidt Conference.
Nine carbonaceous shale samples from Australia's Cretaceous Toolebuc Formation reveal 20-30% of uranium persists as U(VI) even under anoxic conditions. Researchers used high-energy resolution fluorescence detection x-ray absorption spectroscopy and nanoscale secondary ion mass spectrometry for characterization. The Australian Ocean Data Network published findings presented at the 2023 Goldschmidt Conference.
Duricrust areas, defined as surface or near-surface regolith indurated by siliceous, ferruginous, or other cements, are mapped across the state of Victoria. The Geological Survey of Victoria collected this data, typically recording it in the field at 1:25,000 scale before preparing it for publication. This dataset forms part of a larger geological mapping collection that includes rock units, structural lines, and placer deposits.
Wilcoxon signed-rank test results comparing the GA-ResUNetGAN model with baseline models on the BraTS2023 validation set. The dataset, authored by Muthulakshmi Kirubakaran, contains statistical test outputs for 22 validation subjects. It was last updated on May 18, 2026, and is shared under a CC-BY-4.0 license.
Pakistan's digitally connected urban youth provide evidence on informal generative AI use for health. The dataset contains survey results from 1240 participants and 20 interviews, collected by researcher Ahsan Mashhood and published in April 2026. It captures self-reported usage rates, demographic factors, and thematic interview insights.
69.0% of 1240 surveyed digitally connected urban youth in Pakistan reported using Generative AI for health purposes. This dataset contains the adjusted odds ratios from multivariable logistic regression models analyzing predictors of this use. It was published by researcher Ahsan Mashhood in April 2026.
A multi-class video classification dataset for Vietnamese Sign Language, sourced from the AI Challenge PTIT website. The dataset is organized for a computer vision task and supports the Hugging Face Dataset Viewer via a metadata file. It was uploaded by author star092304 and last updated on 2026-05-31.
Great Lakes Guardian Community Fund Recipients lists organizations receiving grants for projects to protect and restore the Great Lakes. The dataset is provided by the Government of Ontario and was last updated on April 17, 2026. Grants were available up to $25,000 for eligible not-for-profit organizations, First Nations, and Metis communities.
A list of Government of Ontario agencies, people, and private sector organizations that have created archival records. The dataset provides descriptions of the records and their creators, linking content creators to their records. It is published by the Government of Ontario under the OGL-CA-2.0 license and was last updated on April 17, 2026.
Pechersk District State Administration in Kyiv publishes information on normative legal acts and draft orders. The dataset includes a link to related data on officials responsible for open data disclosure. It was last updated on 2026-04-29.
Government of British Columbia provides a list of Ministry of Health - Vital Statistics purchasing card transactions for fiscal year 2011. The data reflects expenditures within the original ministry structure before reorganization events in October and March. The dataset is available in CSV and HTML formats.
nvidia/LIBERO_LeRobot_v3 is a LeRobotDataset v3.0 conversion of the LIBERO benchmark for lifelong robot learning. The dataset packages task suites with Parquet state/action data, MP4 video observations, and LeRobot metadata. It was last updated on 2026-05-31.
Tabulated detection results generated by the Single Shot MultiBox Detector (SSD) architecture. The 91.6 KB CSV file contains predicted bounding box coordinates, class labels, and associated confidence scores. Author Lin Lin uploaded the dataset under a CC-BY-4.0 license on figshare, with a last update timestamp of 2026-05-07.