Loading...
Loading...
Image classification, object detection, segmentation, face recognition, OCR, image generation, video understanding
15,891 datasets
PACIFICA (PACIFic ocean Interior CArbon) project data collected from the WAKATAKA MARU in the North Pacific Ocean from September 5 to 18, 2003. Tsuneo Ono of the National Research Institute of Fisheries Science collected these discrete sample and profile observations using CTD and Coulometer instruments. The data include dissolved inorganic carbon, nitrate, phosphate, silicate, salinity, and water temperature.
PACIFICA (PACIFic ocean Interior CArbon) project data includes ALKALINITY, DISSOLVED INORGANIC CARBON, pH, and other chemical and physical variables. The data were collected from the Hokusei Maru in the North Pacific Ocean from June 22 to July 6, 1999 using CTD, Coulometer, and bottle instruments. Shuichi Watanabe of JAMSTEC; Mutsu Institute for Oceanography (MIO) collected these data as part of cruise HO99-2.
Discrete profile measurements of dissolved inorganic carbon, total alkalinity, temperature, salinity, dissolved oxygen and nutrients collected during the R/V Marion Dufresne cruise OISO-03 in the Indian Ocean from 1998-12-05 to 1998-12-27. These data were collected by Claire Lo Monaco and Nicolas Metzl of Sorbonne University as part of the International CLIVAR Global Ocean Carbon and Repeat Hydrography Program.
Discrete and profile measurements of dissolved inorganic carbon, alkalinity, temperature, salinity, and pressure collected aboard the HEALY research vessel in the Arctic Ocean and Beaufort Sea from September 7 to 27, 2010. The data were collected by NOAA PMEL and University of Alaska researchers as part of the CLIVAR_HLY1003 cruise to quantify changes in ocean heat, freshwater, and carbon dioxide storage. This dataset contributes to the International CLIVAR Global Ocean Carbon and Repeat Hydrography Program.
Discrete profile measurements of dissolved inorganic carbon, alkalinity, pH, temperature, salinity, oxygen, nutrients, and dissolved organic carbon collected during the R/V Mirai cruise GO-SHIP_P10N in the North Pacific Ocean from 2014-07-09 to 2014-07-15. The dataset was contributed by the National Oceanic and Atmospheric Administration and represents observations along the WHP-P10N section, which was previously observed by Japanese agencies in 2005, 2011, and 2014.
CLIVAR_OISO02 dataset contains discrete profile measurements of dissolved inorganic carbon, total alkalinity, temperature, salinity, dissolved oxygen, nutrients, and isotopic ratios collected during the R/V Marion Dufresne cruise OISO-02 in the Indian Ocean from August 18 to September 9, 1998. The data were collected by Claire Lo Monaco and Nicolas Metzl of Sorbonne University as part of the International CLIVAR Global Ocean Carbon and Repeat Hydrography Program. The program aims to quantify changes in the storage and transport of heat, fresh water, carbon dioxide, and related parameters.
Discrete sample and profile data collected from the Hokusei Maru research vessel in the North Pacific Ocean from 1998-06-22 to 1998-07-06. The data include dissolved inorganic carbon, pH, alkalinity, temperature, salinity, dissolved oxygen, and nutrients. These data were collected by Shuichi Watanabe of JAMSTEC as part of the PACIFICA international synthesis project.
Discrete chemical and physical oceanographic data were collected from the Hokusei Maru in the North Pacific Ocean from July 10 to July 21, 2001. The dataset includes measurements for dissolved inorganic carbon, pH, alkalinity, nutrients, dissolved oxygen, temperature, and salinity. These data were collected by Masahide Wakita of Hokkaido University as part of the international PACIFICA project for synthesizing Pacific Ocean interior carbon data.
230 receipt images are labeled with structured JSON data extracted via the Gemini model, covering a variety of merchants, formats, and receipt layouts. The dataset is intended for fine-tuning vision-language models on document OCR tasks and was uploaded by author docjay131 to Hugging Face. It was last updated on April 5, 2026.
Car Number Plate with Annotation 100 imges Dataset is a collection of 100 images for training object detection models. The description indicates it is intended for number plate detection using computer vision techniques like YOLO. The dataset was sourced from Kaggle, but details on its creator, license, and update history are unknown.
South America and Antarctica are covered by ground magnetometer data from the 12-station South American Meridional B-field Array (SAMBA). The data, processed and despiked in the HDZ coordinate system, begins in April 2002 and is provided at 1-second resolution for most stations. It is produced by SCIOPS and available in ASCII and CDF formats.
NOAA's National Coral Reef Monitoring Program collects carbonate chemistry data from random and fixed sites across the Atlantic basin to assess spatial and temporal variation in coral reef seawater systems. Data includes parameters like total alkalinity and dissolved inorganic carbon, analyzed by the Atlantic Oceanographic Meteorological Laboratory. Sampling methods range from manual collection by divers to automated subsurface samplers and buoy-based calibration validation.
CEDAR Data Base holds outputs from several large-scale atmospheric models, including the Thermosphere Ionosphere General Circulation Model and the Assimilative Mapping of Ionospheric Electrodynamics procedure. The collection includes specific runs for dates like 22 March 1979, generic runs for winds and tides, and outputs for specific longitudinal points. These models were developed by researchers at NCAR/HAO, including Raymond Noble, Arthur Richmond, Jeffrey Forbes, and Maura Hagan.
Solar wind plasma electron and ion flux measurements from the Advanced Composition Explorer (ACE) satellite's SWEPAM instrument. Data provides detailed solar wind conditions every minute, with parameters averaged over intervals from 64 seconds to 27 days. The dataset is produced by the SCIOPS organization.
A curated image dataset for identifying pet species using machine learning, sourced from Kaggle. The dataset's author, organization, and specific size are unknown.
A real-time JSON feed and map of unplanned road closures within the Australian Capital Territory. The dataset includes closure types such as light rail, road works, emergency, and inclement weather, with fields for start and end times in Unix format and geographic coordinates. It is maintained by the ACT Government's TCCS.
Starrydata2 is a database of experimental properties for inorganic materials, compiled by researcher Tomoya Mato. The dataset is shared as a 51.0 MB ZIP file under a CC-BY-4.0 license. It was last updated in April 2026.
Yenisei Governorate reports from historical archives form the basis for this collection of 788,436 synthetically generated images of pre-reform Russian words with corresponding text transcriptions. The dataset was created by author sherstpasha using a ScrabbleGAN generative model adapted to mimic the historical handwriting style of these reports. It was last updated on April 10, 2026.
Department of Youth and Community Development (DYCD) Contracts provides information on contracts issued by the New York City agency, including funded dollar amounts and registration details. Contract information is displayed per fiscal year, with separate rows for each contract-fiscal year pair, and includes business units, provider organizations, dates, status, and RFP details. The dataset is published by the City of New York on the datagov platform and was last updated on March 15, 2026.
A large-scale OCR dataset aggregates 14 public benchmarks for text detection and recognition. It pairs images with transcribed text, bounding boxes, and polygon coordinates for text regions. The dataset is authored by Yesianrohn and was last updated on HuggingFace in April 2026.