Loading...
Loading...
Image classification, object detection, segmentation, face recognition, OCR, image generation, video understanding
16,020 datasets
Ten micrometeorological sites in Antarctica's McMurdo Dry Valleys recorded temperature, humidity, wind speed, wind direction, pressure, sunlight, and heat transfer automatically every 5-15 minutes. The data was collected by SCIOPS in 1993-1994 to understand the valleys' formation and model the effects of global warming on glaciated regions. The dataset's primary aim was to validate physical models for predicting climate change impacts on similar environments.
NOAA/WDS Paleoclimatology archives a tree ring dataset from Oulanka National Park, Finland. The chronology covers 252 calendar years, from 224 to -28 years before present. The data was published by the NOAA National Centers for Environmental Information in 1978.
NOAA's Paleoclimatology archive provides a 500-year time series of trace element data from an ice core drilled in the Puruogangri Ice Cap, China. The data coverage spans from 453 to -45 calendar years before present, indicating a record extending into the future relative to the 1950 baseline. This dataset was published by NOAA National Centers for Environmental Information in 1995.
478 calendar years of borehole temperature data from Russia provide a direct physical record of past ground thermal conditions. The dataset contains climate reconstruction parameters from a single borehole site, Argagan-2167, curated by NOAA's National Centers for Environmental Information. This archived study was published in 1978.
373 to -46 calendar years before present, this dataset provides a fire history reconstruction from tree-ring analysis at the China Wall site in Colorado. The data is archived by the NOAA National Centers for Environmental Information under its Paleoclimatology program. The associated study type is Fire and the record was last updated in 1996.
Global 1990 emissions data for non-methane volatile organic compounds from human activities. The inventory includes emissions from specific compound groups like alkanols, alkanes, alkenes, aromatics, esters, and chlorinated hydrocarbons. It was derived from the EDGAR 2.0 database by the SCIOPS organization.
FEWS NET provides weekly staple food price data for the Democratic Republic of the Congo. The dataset is published on the HDX platform and was last updated on 2026-03-03. Data is available in JSON, XLSX, and CSV formats under a CC-BY-4.0 license.
Project records detail software developments managed by a public entity, organized by year and associated with addressed technological requirements. The dataset includes information such as requesting department, project description, and involved information system. It is published by www.datos.gov.co to promote transparency and was last updated in January 2026.
CG-PKINet-Sigmoid-YOLO_Files is a dataset hosted on Kaggle. The title suggests it contains files related to a YOLO-based object detection model, possibly named CG-PKINet with a Sigmoid activation component. Its specific contents, scale, and authorship are not detailed in the available metadata.
Keypoint data likely extracted from multi-camera video sequences for the purpose of fall detection. The dataset is hosted on Kaggle, but its author, size, and specific collection details are unknown. Columns suggest it contains pose estimation coordinates, but the exact structure and volume of data require verification after download.
Optical character recognition data related to the European Union, published by the HiTZ organization. The dataset was last updated on HuggingFace on April 16, 2026. Its specific content, scale, and collection method are not detailed in the provided metadata.
Neraca Perdagangan 2016-2025 is a dataset published on Kaggle. The title suggests it contains Indonesian trade balance statistics over a ten-year period. The dataset's actual content and structure require verification after download.
CARINA/29HE19951203 cruise data contains biological, chemical, and physical profile measurements from the South Atlantic Ocean. The dataset includes discrete sample and CTD data for alkalinity, dissolved oxygen, nutrients, chlorophyll, and other variables relevant to carbon system studies. It is part of the international CARINA synthesis project for creating internally consistent ocean biogeochemical datasets.
CARINA/29HE19960117 provides 15 biogeochemical variables, including alkalinity, nutrients, and chlorophyll, collected via CTD and bottle sampling from the HESPERIDES research vessel in the South Atlantic Ocean. This cruise data is part of the CARINA synthesis project, which created an internally consistent dataset for carbon system studies across the Atlantic, Arctic, and Southern Oceans. The dataset includes measurements from January 17 to February 5, 1996.
Chemical and physical oceanographic data collected from the LOUIS S. ST. LAURENT in the Arctic Ocean, Chukchi Sea, and North Greenland Sea from 1994-07-24 to 1994 09-01. The dataset includes measurements of dissolved inorganic carbon, alkalinity, chlorofluorocarbons, nutrients, dissolved oxygen, temperature, and salinity. Data were collected by researchers from the Bedford Institute of Oceanography, Fisheries and Oceans Canada, Scripps Institution of Oceanography, and the University of Washington as part of the CARINA synthesis project.
Synthetic Invoice OCR Dataset is a collection of artificially generated invoice images. The dataset includes realistic degradation effects to simulate real-world document scanning conditions. It was sourced from the Kaggle platform, but details on its creator, size, and update history are not provided.
IBM Research's 900K Judgements dataset contains approximately 900,000 pairwise comparison judgements from multiple LLM judges evaluating model responses. The data was collected for the paper 'Mediocrity is the key for LLM as a Judge Anchor Selection' to investigate anchor selection in LLM-as-a-judge evaluation. The dataset was last updated on March 18, -2026.
A collection of photographs published on Kaggle. The dataset's specific content, size, and origin are not detailed in the available metadata. Further details about the collection methodology, authorship, and temporal coverage are unknown.
A dataset likely containing images of hand gestures for training a Convolutional Neural Network (CNN) model. It was published on Kaggle, but details about its size, creator, and last update are unknown. The specific content and annotations require verification after download.
Kaggle hosts a set of model weights for a ResNet50 neural network. The weights appear to have been pruned using an L1-norm technique with a factor of 0.3, likely to reduce model size and complexity. The author, organization, and original data source are unknown.