Loading...
Loading...
Image classification, object detection, segmentation, face recognition, OCR, image generation, video understanding
15,926 datasets
yolo_model is a dataset published on Kaggle. The title suggests it is related to the YOLO (You Only Look Once) family of object detection models. The dataset likely contains images and annotations for training or evaluating such models, but specific details on size, source, and content are unavailable.
1977 to present data on municipal wastewater treatment plants discharging into marine waters under the Clean Water Act Section 301(h) waiver program. The dataset, managed by SCIOPS and sourced from NASA EarthData, contains monitoring information for facilities granted modified permits waiving secondary treatment requirements.
Program sites funded by the New York City Department of Youth and Community Development (DYCD). The dataset lists service categories, providers, and site locations across the city. It was published by data.cityofnewyork.us and was last updated in March 2026.
Foreground segmentation masks generated by SEEM have been added to standard datasets used for CLIP-based prompt tuning research like CoOp. The masks use RGB values of [255, 255, 255] for foreground and [0, 0, 0] for background, with the shorter image side fixed to 512 pixels. This dataset was created by author JREion and last updated on Hugging Face in April 2026.
A multilateral treaty establishing a standardized classification system for goods and services used in trademark registration. The dataset is an archived publication from Global Affairs Canada, last updated in February 2026. It is presented as a PDF document for research and recordkeeping purposes.
One bilateral taxation convention between Canada and Algeria governs the avoidance of double taxation and prevention of fiscal evasion for income and capital. The archived document, including a protocol, was published by Global Affairs Canada and is not subject to current web standards.
A dataset of New York City agencies, offices, and other organizations with NYC-specific governance functions. The data is provided by the City of New York and was last updated on March 22, 2026. It includes advisory or regulatory organizations, public benefit or development organizations, elected offices, mayoral agencies and offices, nonprofit organizations, and state agencies.
Full-page manuscript PDFs and DOCX transcriptions from the Omar Al-Saleh memoir collection, covering the period from 1951 to 1965. The data was created by U4RASD as part of the NAKBA NLP 2026: Arabic Manuscript Understanding Shared Task and was last updated on April 8, 2026.
The Uganda - Subnational Displacement Forecasts dataset provides three-month settlement-level projections for South Sudanese refugees and asylum seekers in Uganda. It is based on the AHEAD model and contains both observed values from official sources and forecasted model estimates. The dataset was last updated on 2026-03-16 and is published by the Danish Refugee Council under a CC-BY-4.0 license.
East Baton Rouge Parish public libraries provide monthly WiFi usage statistics from 2023 onward. The data is organized by branch, year, and month, counting only patron connections to public-facing networks. The City of Baton Rouge published this dataset, which was last updated in March 2026.
A dataset titled 'plantvillage_yolo11' is hosted on Kaggle. The title suggests it contains images formatted for use with the YOLO (You Only Look Once) object detection model, version 11. The specific subject is likely related to the PlantVillage project, which typically involves images of plant leaves for disease classification.
A dataset published on Kaggle for object detection tasks. The title suggests it likely contains images annotated for detecting Light-Emitting Diodes (LEDs). Metadata is minimal; the specific number of images, annotation format, and collection details are unknown.
Soda-D-YOLO2 is a dataset hosted on Kaggle. Its title suggests a focus on object detection, likely for training or benchmarking YOLO (You Only Look Once) models. The dataset's specific content, size, and origin are not detailed in the available metadata.
Spiking ResNet 18 is a dataset published on Kaggle. The title suggests it relates to a spiking neural network version of the ResNet-18 architecture, likely for computer vision tasks. The dataset's specific content, scale, and origin are not detailed in the provided metadata.
A dataset for object detection tasks, likely containing images of various vegetables. It is formatted for use with the YOLO (You Only Look Once) object detection framework. The dataset is hosted on the Kaggle platform, but specific details on its size, creation date, and authorship are not provided in the available metadata.
A collection of document images likely containing invoices and receipts intended for Optical Character Recognition (OCR) tasks. The dataset is hosted on Kaggle, but its specific scale, creation details, and update history are not provided in the available metadata. The content and structure must be verified after download.
Seafloor sediment samples were collected from eight sites along the Victoria Land coast to investigate relationships between sediment texture, microorganisms, water depth, and currents. The dataset includes samples for grain size analysis, foraminiferal work, and water samples for oxygen and carbon isotope analysis, collected in 1987. This data from SCIOPS may help interpret sea level changes in cored sequences by comparing modern shoreline conditions with fossil records.
This dataset reports water quality variables for surface water from a permanent sampling site on Robson Creek in Far North Queensland. It includes measurements of major cations and anions, trace heavy metals, solids, and inorganic and organic nitrogen and phosphorus from 2013.
March 1998 data on arsenic species production in the Subantarctic Zone of the Southern Ocean. Surface and vertical profile samples were collected along a meridional transect south of Australia, from 42°S to 55°S. The dataset was produced by the organization AU_AADC and is hosted on NASA's EarthData platform.
80,000 oceanographic stations in the Atlantic from 1900-1991 provide vertical profiles of temperature and salinity. Data includes 65,000 Black Sea stations with hydrochemical and meteorological observations from 1910-1992, plus surface station data from coastal Guinea. The dataset was compiled by the Ukrainian Academy of Science's MHI and other sources, with the latest records from 1992.