Loading...
Loading...
Image classification, object detection, segmentation, face recognition, OCR, image generation, video understanding
15,506 datasets
A two-tiered README framework developed by a working group from seven Canadian universities. The framework defines nine minimum and thirteen enhanced metadata elements to standardize documentation for research data deposits in the Borealis repository. The work involved analyzing approximately 20 existing README templates from Canadian and international sources using AI-assisted analysis and manual quality assurance.
health.data.ny.gov provides monthly snapshots of enrollments in the New York State Donate Life Registry from September 2008. The dataset includes county-level population estimates and enrollment percentages. It is a confidential database used by donation professionals to confirm a person's consent status.
Replication files for analyses in the Political Communication article 'Democratic Institutions Under Threat'. The data was authored by Justin Wedeking and is hosted on the Dataverse platform. The dataset was last updated on June 8, 2026.
UFOCR is an open dataset of declassified records from the FBI and U.S. Department of War on UFOs, UAPs, and extraterrestrial investigations. The original government archives, which include scanned typewriter pages and handwritten notes, have been parsed into structured text using Reducto. The dataset was last updated on Hugging Face by the user 'reducto' on May 11, 2026.
A systematic narrative review synthesizing 70 peer-reviewed studies published between 2010 and 2026 on conflict-competent leadership. The dataset, authored by Gabriel Osei Forkuo and available on figshare, includes text and tabular files totaling 6.4 MB. It was last updated on April 17, 2026.
Long-term Industry Projections for a 10-year time horizon are provided for the state and 10 labor market regions. The dataset includes columns such as Area, Industry Title, Base Year Employment Estimate, and Projected Year Employment Estimate. It was published by data.ny.gov and last updated on 2026-04-15.
An original survey of Communal Council participants and 51 interviews with government officials and grassroots members support a study of Venezuela's Consejos Comunales. The dataset, by Jose Morales-Arilla, was last updated on May 1, 2026, and is hosted by the American Journal of Political Science (AJPS) Dataverse. It underpins a theoretical and empirical analysis of how captured participatory institutions affect development and politics.
1959 geophysical survey data from the Mount Morgan area in Queensland, Australia, focused on underground water resources. The dataset is a legacy product from Geoscience Australia with no abstract available, and its specific contents require verification after download.
MNR administrative boundaries divide Ontario into organizational units for managing ministry programs and resources at district and regional levels from 1997 to 2022. The Government of Ontario created and maintained this data, which is no longer being updated. It includes regional, district, and area boundaries originally defined by metes and bounds, topographic features, and geographic townships.
Bristol Channel and South Wales records of the invasive tube-building polychaete worm Ficopomatus enigmaticus. The dataset describes laboratory protocols developed for maintaining broodstock, spawning, larval culture, and settlement bioassays. The data was aggregated by the Government Digital Service from the eu_open_data platform.
The Tibetan Plateau is the focus of this dataset containing measurements of particulate organic carbon and mineral-associated organic carbon in soils. It originates from long-term warming experiments, was authored by Siyi Sun, and was last updated on April 20, 2026. The data is provided in a 3.8 KB CSV file.
Nine sites across Scotland's River Tay catchment provide monthly measurements of dissolved organic carbon, dissolved inorganic carbon, nutrients, and greenhouse gases (CO2, CH4, N2O). Sampling occurred from February 2009 to December 2010, with sites co-located at existing Scottish Environment Protection Agency (SEPA) monitoring stations. The dataset offers a spatially and temporally resolved snapshot of carbon and nutrient dynamics in a major UK river system.
413 individual HeLa cells were imaged to create this multimodal dataset for cytoskeletal analysis. The dataset comprises z-stacks across reflection interference contrast microscopy (RICM), brightfield, widefield fluorescence, total internal reflection fluorescence (TIRF), and confocal microscopy. It was curated by Carleton CTELab and last updated on 2026-05-25.
Dataset contains 10.000 Indonesian Wikipedia article summarization pairs. Each entry has an instruction (input) and a summary (output) in Indonesian. Designed for training/evaluating Indonesian LLMs.
Validated satellite image patches for detecting military vessels, exported from a review workflow. The dataset includes image patches, metadata, and object annotations in multiple formats like CSV, JSONL, and YOLO. It was created by DefendIntelligence and last updated on 2026-05-01.
2022 onward data collected by Slocum Gliders in the North Sea near the JONSIS line east of Orkney. The British Oceanographic Data Centre (BODC) manages the data, which includes pressure, temperature, conductivity, salinity, water velocities, depth, and engineering variables. Data are provided in near-real-time, recovery, and quality-controlled delayed mode versions by the UK Met Office and National Oceanography Centre.
A review of lithology, sedimentary structures, and palaeontology subdivides the Triassic system in the Canning Basin into four environmental episodes. The sequence records a transgression culminating in the Smithian, followed by a regression with minor marine incursions, and a trend to increasing aridity correlated with the withdrawal of the Blina Sea. The dataset is provided by Geoscience Australia Data and was last updated on 2026-03 25.
A register of Ukrainian enterprises permitted to act as customs brokers, maintained by the States site of Ukraine. The dataset includes enterprise names, identification codes, authorization numbers, statuses, and the number of authorized agents. The record was last updated on April 28, 2026.
Southern Tasman Sea data details two manganese nodules and sediment samples collected during a 1979 geological cruise aboard HMAS Kimble. The dataset includes geochemical analysis of the nodules, reporting nickel, copper, and cobalt content, and describes sediment types from five sampling stations. It provides bathymetric context with depths ranging from 4300 to 5100 meters near the Gascoyne Seamount.
NMR spectra data supporting a thesis on catalytic organic transformations using covalent organic frameworks. The data includes ยนH and ยนC NMR spectra for linkers, photocatalysts, and products from reactions like [3+2] annulation and [4+2] cyclization. The dataset was authored by Sheung Chit Cheung and last updated on May 19, 2026.