Loading...
Loading...
Image classification, object detection, segmentation, face recognition, OCR, image generation, video understanding
15,952 datasets
NCEI Accession 0143398 contains discrete sample and profile data collected from the KNORR research vessel during the GEOSECS Atlantic cruise from July 1972 to April 1973. The data includes measurements of dissolved inorganic carbon, alkalinity, temperature, salinity, dissolved oxygen, nutrients, and radioactive tracers across the North Atlantic Ocean, North Greenland Sea, Norwegian Sea, South Atlantic Ocean, and Southern Oceans. These data were collected by researchers from Columbia University's Lamont-Doherty Earth Observatory and Princeton University.
Surface underway observations from SHIRASE in the Bali Sea, Celebes Sea, Indian Ocean, Java Sea, Makassar Strait, Philippine Sea, South Pacific Ocean, Southern Oceans, and Tasman Sea from 1992-11-15 to 1993-03-20. The data include dissolved inorganic carbon, alkalinity, salinity, sea surface temperature, air temperature, barometric pressure, wind direction, and wind speed. These data were collected by researchers from Hokkaido University and the Meteorological Research Institute as part of the MRI Shirase I dataset.
The North Pacific Ocean and Philippine Sea are the geographic scope for this dataset. It contains chemical, meteorological, and physical surface underway observations collected from the RYOFU MARU vessel between October 1997 and July 1999. Hisayuki Y. Inoue of Hokkaido University and Masao Ishii of the Meteorological Research Institute collected the data, which includes dissolved inorganic carbon, alkalinity, salinity, and sea surface temperature.
NCEI Accession 0000193 includes dissolved inorganic carbon, alkalinity, pCO2, pH, temperature, salinity, dissolved oxygen, nutrients, and carbon-13 measurements collected from discrete samples and profiles during NOAA Ship Discoverer cruise EQ92_FALL in the Equatorial Pacific Ocean from 1992-09-06 to 1992-12-08. The data were collected by Rik Wanninkhof of NOAA's Atlantic Oceanographic and Meteorological Laboratory and Richard A. Feely of NOAA's Pacific Marine Environmental Laboratory using instruments including alkalinity titrators, CO2 gas analyzers, coulometers, CTDs, and Niskin bottles.
The Indian Ocean cruise WOCE_S04I collected discrete profile measurements of dissolved inorganic carbon, total alkalinity, pCO2, temperature, salinity, dissolved oxygen, nutrients, chlorofluorocarbons, helium, and carbon isotopes. These data were collected by researchers from Columbia University and the Rosenstiel School of Marine and Atmospheric Science as part of the World Ocean Circulation Experiment between May and July 1996. The final WOCE collection covers approximately 23,000 stations from 94 cruises conducted between 1990 and 1998.
A cleaned corpus of English-language texts published before the year 1900, intended for training the GPT-1900 model. The dataset includes full document text, publication year, title, source identifier, and OCR quality scores. It was created by author 'mhla' and last updated on March 29, -2026.
586 discrete chemical measurements were collected during the Discovery 200 research cruise in February and March 1993. The dataset includes concentrations of dimethyl sulphide (DMS), dimethylsulphoniopropionate (DMSP), tetrachloromethane, carbon dioxide, and methyl iodide from the Indian and Southern Ocean. Measurements were made by scientists including Brian King, J. Robertson, Thomas Haine, and Sue Turner as part of the World Ocean Circulation Experiment.
FraunhoferIOSB developed this collection of 105,500 synthetic images covering 211 German traffic sign classes. It includes rare signs from the 2020 German traffic regulations and provides a synthetic counterpart to the GTSRB benchmark.
Gas adsorption and crystallographic data for single-crystal two-dimensional covalent organic frameworks (2D COFs), specifically related to high-capacity methane storage. The dataset, authored by Baoqiu Yu, contains 414,879 bytes of data in XLSX format and is licensed for open use. It was last updated on March 25, 2026.
A directory of publisher users for the Colombian government's open data portal, managed by datos.gov.co. It lists entities and organizations authorized to publish data, along with their user status and contact information. The dataset was last updated in March 2026.
Canada's Public Service Commission manages Federal Student Work Experience Program (FSWEP) referral requests from federal hiring departments. The data covers student applications and referrals for recruitment across federal organizations. Row and column counts are not specified in the input.
Discrete oceanographic measurements from the Iceland Sea (LN6) time series cover a 28-year period from 1985 to 2013. The data includes partial pressure of carbon dioxide, dissolved inorganic carbon, temperature, salinity, nutrients, and dissolved oxygen. Observations were collected from profile and discrete samples during cruises of the R/Vs Arni Fridriksson and Bjarni Saemundsson.
18 sampling stations were occupied during the R/V Pelican cruise PE14-10b from October 17-20, 2013. The dataset contains Acoustic Doppler Current Profiler (ADCP), Conductivity-Temperature-Depth (CTD), Marks, and cruise track data, collected by NOAA NCEI to study hard banks and deep waters west of the Mississippi River. It covers an area from offshore Alabama to west of the Mississippi River delta.
A dataset titled 'soda-a-yolo' is hosted on Kaggle. The dataset's title suggests it is intended for object detection tasks, likely using the YOLO (You Only Look Once) framework. Metadata such as column descriptions, sample data, and size are unavailable, limiting pre-download assessment.
A training dataset for optical character recognition (OCR) tasks, published on Kaggle. The dataset's specific content, size, and origin are not detailed in the provided metadata. Its intended use is likely for developing or benchmarking machine learning models that convert images of text into machine-encoded text.
RealWaste Image Segmentation is a dataset hosted on Kaggle. The dataset likely contains images of waste items with pixel-level segmentation masks. The author, organization, and specific details about the data volume and collection method are not provided.
Discrete profile measurements of 15+ oceanographic variables, including dissolved inorganic carbon, chlorofluorocarbons, and stable isotopes, were collected during a 2013 NOAA research cruise. The data supports the International CLIVAR Global Ocean Carbon and Repeat Hydrography Program's mission to quantify changes in ocean heat, freshwater, and CO2 storage. Measurements were taken from the NOAA Ship Ronald H. Brown in the Atlantic Ocean between August and October 2013.
From January 5 to February 9, 2018, the R/V Marion-Dufresne collected surface underway measurements during the OISO-28 cruise in the Indian Ocean. The dataset includes dissolved inorganic carbon, total alkalinity, temperature, and salinity, and is part of the long-running OISO program initiated in 1998. These measurements contribute to international carbon synthesis efforts like SOCAT and GLODAP.
The OISO-19 cruise dataset provides surface underway measurements of dissolved inorganic carbon, total alkalinity, temperature, and salinity from the R/V Marion-Dufresne in the Indian Ocean from January to February 2011. This data is part of the long-term OISO program, initiated in 1998, which monitors carbon dioxide and related parameters in the South-Western Indian and Southern Oceans. The measurements are regularly incorporated into international synthesis projects like SOCAT and GLODAP.
Off the east coast of Florida, this dataset contains bottle measurements of dissolved inorganic carbon and total alkalinity. It was collected during the Ocean Acidification Cruise RB1201 from NOAA Ship Ronald H. Brown in support of the NOAA Ocean Acidification Program and Climate Program Office.