Loading...
Loading...
Image classification, object detection, segmentation, face recognition, OCR, image generation, video understanding
15,993 datasets
Southern Caribbean Sea measurements of oceanic methane (CH4), carbon dioxide (CO2), and ammonia (NH3) from the Cariaco Basin. Data collection began in November 1995 and is ongoing, conducted by researchers Mary Scranton, Yrene Astor, and Kent Fanning as part of the CARIACO project.
A continuously-updated compilation of seafloor topography derived from multibeam bathymetry data, merged with land topography from the Shuttle Radar Topography Mission. The synthesis began as the Ridge Multibeam Synthesis and has been expanded to include global and coastal ocean data. It is maintained by SCIOPS as a multi-resolution gridded digital elevation model.
Aqua MODIS Level-3 data provides global mapped concentrations of particulate inorganic carbon (PIC), primarily calcite from coccolithophores. The dataset is produced by OB_CLOUD using the Balch & Gordon algorithm applied to ocean-color reflectance. The version is 2022.0.
2022.0 version data provides global measurements of particulate inorganic carbon (calcite) concentration in the ocean surface. The dataset is produced by the OB_CLOUD organization using the Balch & Gordon algorithm applied to Aqua MODIS satellite ocean-color reflectance. It supports analysis of coccolithophore blooms and marine carbonate cycling.
UEX-PVSeg is a benchmark dataset for photovoltaic panel segmentation in remote sensing imagery. It was established to facilitate rigorous evaluation of computer vision models on PV interpretation tasks. The dataset is a key contribution of the PANEL paper and was last updated on HuggingFace in March 2026.
Annotated images of small cola bottles. The dataset is intended for quality control applications. The author, organization, and specific size are unknown.
JAX and NYC datasets for training and evaluating Skyfall-GS, a hybrid framework for synthesizing city-block scale 3D urban scenes. The data was uploaded by author jayinnn and last updated on March 18, 2026. The datasets combine satellite reconstruction with diffusion refinement.
Fiscal year-end headcount data for full-time and full-time equivalent employees across Mayoral Agencies and Covered Organizations. The dataset includes breakdowns by agency, personnel type, and funding source, and is updated twice annually. It is published by the City of New York via data.cityofnewyork.us.
A pre-trained ResNet-152 version 2 model, likely for image classification tasks. The dataset is hosted on Kaggle, but its specific origin and creation date are unknown. The model's architecture suggests it was trained on a collection of images, possibly for a specialized domain.
Agency performance measures for New York City, organized by operational goal. The data components appear in the Dynamic Mayor's Management Report (DMMR), the Mayor's Management Report (MMR), and the Preliminary Mayor's Management Report (PMMR). The dataset is published by the City of New York and was last updated on March 15, 2026.
DYCD-funded service categories and their locations across New York City. The dataset lists program areas, provider organizations, and specific sites where services are delivered. It is provided by the City of New York and was last updated on March 15, 2026.
MM-Lifelong provides 181.1 hours of video footage across three domains for multimodal lifelong understanding, released by CG-Bench in 2026. The dataset includes 1,289 questions and 1,810 clue intervals designed to test reasoning over extended temporal spans. It specifically targets long-context video comprehension with questions requiring up to 10+ hours of dependency reasoning.
Monthly computer login counts for patrons at 14 library branches in East Baton Rouge Parish. The dataset is organized by branch, year, and month, and is provided by the City of Baton Rouge. It was last updated on March 8, 2026.
SignMatic ASL Keypoint Dataset contains data for 50 words and an idle state. It is hosted on Kaggle and appears to be designed for American Sign Language (ASL) recognition tasks. The dataset likely contains keypoint coordinates extracted from video or image data.
A dataset containing the pre-trained weights for the EfficientNet-B0 convolutional neural network architecture. The model is likely intended for image classification tasks. It was published on Kaggle, but details on the original training data, author, and update date are unavailable.
Pigment and algal group concentration data from the Antarctic BROKE expedition, collected in 1996. The dataset includes marker pigment concentrations and Chlorophyll a allocations to eight algal groups derived from CHEMTAX analysis. It was provided by the Australian Antarctic Data Centre (AU_AADC) via NASA's Earthdata platform.
Monthly emissions data for 1990 provides inventories of isoprene, terpenes, and other reactive volatile organic compounds on a one-degree latitude by longitude grid. The dataset was compiled by the Global Emissions Inventory Activity (GEIA) under the International Global Atmospheric Chemistry project. Each file contains up to 64,800 data points representing grid cells.
National Meteorological Center gridded data for both Northern and Southern Hemispheres. The dataset contains 145 x 37 global grids for all atmospheric levels, stored in a binary packed format. It was produced by NOAA NCEI and last updated in April 1997.
1986 to 1992 data from the First ISCCP Regional Experiments, designed to improve cloud and radiation models. It contains processed concentration data for cirrus cloud particles based on habit type and area ratio, collected by the NCAR Kingair aircraft using PMS 2D-C and 2D-P probes. The dataset was produced by the LARC_ASDC organization.
Multi Class YOLO Split Dataset is a computer vision dataset published on Kaggle. The dataset's title suggests it is formatted for the YOLO object detection framework and contains multiple object classes. Metadata is minimal; actual content, scale, and creation details require verification after download.