Loading...
Loading...
Image classification, object detection, segmentation, face recognition, OCR, image generation, video understanding
15,930 datasets
Information relating to the DBS Organisation, published on the eu_open_data platform. The dataset is provided by the Government Digital Service, though its specific temporal coverage and scale are not detailed in the available metadata.
Persian Number OCR Dataset is a collection of images for optical character recognition tasks. It is hosted on Kaggle, but the author, organization, and specific collection details are not provided. The dataset's size, format, and annotation specifics are unknown from the available metadata.
License plate images annotated with line-level bounding boxes for optical character recognition tasks. The dataset is hosted on Kaggle, a platform for data science competitions and projects. Specific details regarding the number of images, collection source, and creation date are not provided in the available metadata.
January 2011 maps from the Australian Ocean Data Network illustrate manganese resources across Australia. The data is organized into two sheets detailing resources by region and by deposit type. This thematic mapping provides a snapshot of national mineral resource assessment from that period.
Phosphorus speciation data from the British Geological Survey includes measurements of reduced and polymerized phosphorus from 3.2-billion-year-old Archean rock samples in the Moodies Group, South Africa, and from laboratory heating experiments simulating magma intrusions. Auxiliary bulk elemental geochemical data is provided to characterize the rock samples. The dataset was last updated on 2026-04 09.
A dataset titled 'Profil Gizi Pangan Nusantara (TKPI)' is available on Kaggle. The title suggests it likely contains nutritional information for various foods across Indonesia. The dataset's specific content, size, and origin require verification after download.
LION Differences File documents segment and node-level changes between releases of New York City's street network data. The file is produced by the City of New York's Department of City Planning and was last updated on March 15, 2026. It enables users to migrate organizational data tied to specific street segment or intersection identifiers.
Propella-1-4b, a small multilingual language model, generated these annotations for text documents across 18 properties. The annotations are organized into six categories, including core content, quality, and safety. The dataset was created by openeurollm and last updated on March 20, 2026.
Uganda's progress toward the Millennium Development Goals is documented in this tabular dataset provided by the World Bank Group. Updated in March 2026, the collection provides standardized development indicators in CSV format for national-level analysis.
The Medicare Diabetes Prevention Program dataset lists suppliers from which eligible Medicare beneficiaries can receive services. Information likely includes organization name, location, contact details, and National Provider Identifier (NPI), and is used to populate a map of service providers. The dataset is provided by the U.S. Department of Health & Human Services via Data.gov and was last updated on March 6, 2026.
OCHA Ukraine's consolidated 3W and 5W data, detailing the operational presence of organizations involved in the humanitarian response. The dataset is licensed under CC-BY-4.0 and was last updated on 2026-03-17. It is sourced from clusters and organizations reporting to OCHA Ukraine.
An image dataset for binary classification of cats and dogs. The description indicates it was built for a deep learning project using Convolutional Neural Networks and the MobileNetV2 architecture. The dataset's size, origin, and specific collection details are not provided.
Global Affairs Canada compiled this set of multilateral agreements and protocols involving Canada and international organizations. The collection includes the World Trade Organization protocol amending the Agreement on Government Procurement, WIPO copyright treaties, the International Convention for the Protection of New Varieties of Plants, and Additional Protocols I and II to the Geneva Conventions. The information is archived and was last updated on the platform in February 2026.
Retina Image Dataset is a collection of medical images published on Kaggle. The dataset likely contains fundus photographs of the retina, which are commonly used for diagnostic purposes. Specifics regarding the number of images, annotation details, and collection methodology are not provided in the available metadata.
A set of pre-trained model weights for the ResNet50-A1 architecture, likely fine-tuned on the ImageNet-1k dataset. The weights are hosted on Kaggle and are associated with the 'timm' (PyTorch Image Models) library. Specific details regarding the training methodology, performance metrics, and exact version are not provided in the available metadata.
Scripts for applying Meta's SAM3 model to detect and segment objects in images using text prompts. The repository contains Python scripts for generating bounding boxes and pixel-level masks on datasets from HuggingFace. It was created by uv-scripts and last updated in March 2026.
Preprocessed auxiliary files for the ModuSeg project support weakly-supervised semantic segmentation. The dataset includes CorrCLIP-generated pseudo masks, VOC-style segmentation annotations for COCO2014, and augmented annotations for VOC2012 SBD, along with image-level labels in JSON format. Author QZing007 uploaded the collection to Hugging Face in April 2026.
WA Ground Ambulance Locally Set Rates provides the established and contracted rates for local governmental ground ambulance service organizations in Washington. The dataset is collected and posted by the Washington Office of the Insurance Commissioner as required by RCW 48.49.205. It includes columns for service providers, effective dates, and specific billing codes for different levels of ambulance service.
Nathalie Lefèvre of Sorbonne University collected this dataset during cruise 35A820070606 from June to July 2007. It includes discrete sample and profile observations of dissolved inorganic carbon, alkalinity, salinity, and sea surface temperature from the Gulf of Guinea and North and South Atlantic Oceans. The data were gathered as part of the International CLIVAR Global Ocean Carbon and Repeat Hydrography Program to quantify changes in ocean carbon storage and transport.
From October 1986 to March 1987, this dataset was collected for the Phase II Outer Continental Shelf Monitoring Program in the Santa Maria Basin, California. It contains physical, chemical, and biological data analyzed to determine the impact of oil and gas drilling and production. The data, submitted by Dr. Hyland and Dr. Lissner, includes measurements of temperature, salinity, hydrocarbons, trace metals, radioisotopes, sediment grain size, and water quality.