Loading...
Loading...
Image classification, object detection, segmentation, face recognition, OCR, image generation, video understanding
15,410 datasets
61,000 audio samples representing urban noise environments in Uganda. The dataset is designed for tasks such as noise classification and audio tagging and includes two configurations, large and small. It was published by Sunbird and described in a peer-reviewed data descriptor.
SharoonArshad processed the How2Sign dataset into a ZIP archive, last updated on 2026-05-17. The data consists of sequences of 450 features per frame, derived from holistic keypoints. The processed format combines 225 position and 225 velocity features from 75 body landmarks.
SynLayers training data contains layered image samples for computer vision tasks. Each row includes a base image, a composited whole image, separate layer images with captions and bounding boxes, and sanitized metadata. The dataset was created by SynLayers and was last updated on 2026-05-18.
Fisheries and Oceans Canada provides seasonal mean dissolved inorganic carbon concentration climatology for the Canadian Pacific Exclusive Economic Zone. The data were generated by averaging results from the British Columbia continental margin model (BCCM) over the 1993 to 2020 period. It includes raster layers at 3 km spatial resolution and 47 vertical levels from the surface to 2400 meters.
Seasonal mean dissolved inorganic carbon concentrations for the Canadian Pacific Exclusive Economic Zone, averaged over a 30-year period from 1981 to 2010. The data, produced by Fisheries and Oceans Canada using the British Columbia continental margin model, provides raster layers at 3 km spatial resolution and 47 vertical levels from the surface to 2400 meters. Spring, summer, fall, and winter climatologies are defined based on specific three-month groupings.
Information from 2007 on the organization of pension funds, provided by the Dutch Ministry of the Interior and Kingdom Relations. The data concerns the outsourcing of tasks and the internal organizational structure of these funds. The dataset is available under a CC-BY-4.0 license.
A replication package for the paper 'The Statehood Problem: Selection Bias and Exclusion in Democratic Transition Research'. The package contains all datasets and code required to reproduce the main results, figures, and tables from the study. It was authored by Emma Gordon and hosted on Harvard Dataverse, with a last recorded update in June 2026.
Quantitative evaluation results for computer vision models on an original dataset. The 9.5 KB Excel file contains metrics for bounding box and keypoint detection expressed as percentages, alongside inference time, best epoch, and training time. Authored by Avinash Bansal and last updated on April 21, 2026, it is shared under a CC-BY-4.0 license on figshare.
Sheila Timp published a dataset on figshare in May 2026. It lists the number of organizations and sickness absence cases, categorized by ISIC (International Standard Industrial Classification) sector. The dataset is a 5.5 KB Excel file.
A 1.5-year study in Greensboro, NC measured volatile organic compound (VOC) emissions from above-ground fuel storage tanks. The dataset, authored by Wyatt M. Champion, uses triggered canister samples and lower-cost sensor packages to speciate VOCs during high-concentration events. It was last updated on March 25, 2026.
RAEv2 Data provides pre-processed datasets and pretrained encoders for the RAEv2 research paper. The collection includes subsets like ImageNet-1k at 256x256 resolution, BLIP3o-captioned images, rendered-text images, and synthetic FLUX images. It was uploaded by nanovisionx and last updated on May 18, 2026.
Two manganese nodules recovered by HMAS Kimbla from a water depth of 4300 meters, about 250 nautical miles southeast of Sydney. The nodules are subspherical, about 10 cm in diameter, and have a high clay content, a low Mn:Fe ratio, and low contents of Ni (0.25%), Cu (0.17%) and Co (0.06%). The data is provided by Geoscience Australia and was last updated in April 2026.
Interball-1 satellite data processed to provide tri-axial magnetic field measurements in the solar wind at a consistent 60-second resolution. The dataset was constructed by Dr. J.M. Weygand for Prof. R.L. McPherron under NSF grants and has been used for superposed epoch and cross-correlation studies. Data is provided in GSE coordinates and stored in BIN and HTML formats.
NASA's ISEE-3 satellite data provides processed solar wind magnetic field measurements linearly interpolated to a 60-second resolution in Geocentric Solar Magnetospheric (GSM) coordinates. The dataset was constructed by Dr. J.M. Weygand for Prof. R.L. McPherron under NSF grants ATM 02-1798 and ATM 02-08501. It was primarily used for superposed epoch and cross-correlation studies on solar wind phenomena.
Historical predictions of E. coli levels and resulting swim advisories for beaches along Chicago's Lake Michigan lakefront. The Chicago Park District issued advisories when predicted levels reached at least 235 Colony Forming Units per 100 ml of water. This dataset is historical-only, with a newer series beginning in 2017.
Fisheries and Oceans Canada provides spatial data for the Northwest Atlantic Fisheries Organization (NAFO) Subareas, Divisions, and Subdivisions. The dataset is derived from the 2020 NAFO Convention and uses the NAD83 datum. It is intended for mapping and illustrative purposes, not for legal use.
Supplementary materials from a 2026 study detail a digenic mechanism for early-onset diabetes in a single family. Author Makie Honda provides a clinical characteristics table and three figures analyzing genetic variants. The 1.2 MB dataset includes a pedigree chart and structural analyses of HNF1A and ABCC8 gene variants.
Late spring data from St. Georges Basin, a coastal lagoon in southeastern Australia, examines benthic nutrient and gas fluxes alongside water column and sediment properties. The study investigates control mechanisms coupling benthic and pelagic processes, focusing on nutrient fractionation. It was contributed by the Australian Ocean Data Network and last updated in April 2026.
Diatom biomass sinking drives nutrient removal in St. Georges Basin, a coastal lagoon in southeastern Australia. The dataset contains measurements of benthic nutrient and gas fluxes, water column properties, and sediment characteristics from a late spring study. It was contributed by the Australian Ocean Data Network and last updated in April 2026.
NYC Open Data Coordinators (ODCs) are listed by last name, first name, and organization name. The dataset is published by data.cityofnewyork.us and was last updated on 2026-04-20. It serves as a directory for the point persons responsible for maintaining published datasets and their documentation.