Loading...
Loading...
Image classification, object detection, segmentation, face recognition, OCR, image generation, video understanding
15,961 datasets
Lamar excavations are part of an interactive Story Map about archaeological work at Ocmulgee National Historical Park in the 1930s. The data is organized in layers for each general area and for the Stratigraphic Survey, based on original field maps and notes. It was published by the Department of the Interior and last updated on March 4, 2026.
Ocmulgee National Historical Park in the United States contains archaeological excavation data from the 1930s. The dataset is part of an interactive Story Map organized by excavation area and stratigraphic survey, based on original field maps and notes. It was published by the Department of the Interior and last updated in March 2026.
Part of an interactive Story Map about the 1930s archaeological excavations at Ocmulgee National Historical Park, organized in layers for each general area and for the Stratigraphic Survey. The dataset is based on original field maps and notes from the Department of the Interior and was last updated in March 2026.
Stratigraphic survey layers are part of an interactive Story Map about archaeological excavations at Ocmulgee National Historical Park in the 1930s. The dataset is organized in layers for each general area and is based on original field maps and notes. It was published by the Department of the Interior and last updated on 2026-03 04.
McDougal and Dunlap Mounds excavations are part of an interactive Story Map about the archaeological excavations at Ocmulgee National Historical Park in the 1930s. The data is organized in layers for each general area and for the Stratigraphic Survey, based on original field maps and notes. It was last updated on 2026-03-04 01:18:07.959143.
A binary CT scan image dataset for lung cancer classification. The dataset contains images categorized as Cancer or Normal. It was sourced from Kaggle, but the author, organization, and specific collection details are unknown.
A collection of CT scan images focused on the kidney. The dataset is hosted on Kaggle, but details on the number of images, collection methodology, and specific attributes are not provided in the metadata. The author, organization, and time range of data collection are unknown.
ResNet10_best is a pre-trained model artifact hosted on Kaggle. The title suggests it contains the best-performing weights for a ResNet-10 architecture, a convolutional neural network commonly used for image recognition tasks. Its specific application domain and training data are not detailed in the provided metadata.
Colville River Delta Landcover Data contains land cover classifications for a region in northern Alaska. The data is organized by U.S. Geological Survey quadrangles and spatially referenced to 50-meter grid cells. It was produced by the USGS using Landsat MSS data, aerial photography, and the National Wetlands Inventory.
Approximately 164,000 YouTube thumbnails paired with their corresponding video titles. The dataset was constructed by collecting public YouTube channel feeds, extracting video metadata, filtering and deduplicating entries, and downloading thumbnail images at scale. It was authored by l3afai and last updated on March 26, 2026.
City of Chicago provides a historical record of all sanitation code complaints reported to 311 since January 1, 2011. The dataset includes duplicate requests labeled in the status field and is updated daily by the Department of Streets and Sanitation.
Sidewalk Management System data tracking inspections, violations, and status for New York City sidewalks. The dataset identifies sidewalk locations by borough, block, and lot numbers and is provided by the City of New York. It was last updated on 2026-03-22.
Projects, buildings, and units reported by the New York City Department of Housing Preservation and Development (HPD) that began after January 1, 2014. The data counts towards the Housing New York plan (2014-2021) and the Housing Our Neighbors plan (2022-present). It is published by the City of New York on the datagov platform.
Sidewalk Management Database - Inspection tracks and organizes inspections and violations for New York City sidewalks. The dataset identifies locations where Department of Transportation inspectors performed sidewalk defect inspections. It is provided by the City of New York and was last updated on March 22, 2026.
Model_ResNet152v2_Global_19_6 is a pre-trained deep learning model hosted on Kaggle. The title suggests it is based on the ResNet152V2 architecture, a common choice for image recognition tasks. Its specific application and training data are not detailed in the available metadata.
Onshore Australia is covered by a gravity anomaly image derived from approximately 1.8 million gravity observations. The final product integrates 1,371,998 stations from the national database and 19,558 from a regional survey, processed with terrain corrections and displayed as a hue-saturation-intensity image. This dataset represents gravity data collected by government, industry, and academia from the 1940s to 2016.
NOAA's National Centers for Coastal Ocean Science (NCCOS) and the National Marine Sanctuary Program (NMSP) collaborated on a biogeographic assessment to support management plan revisions for the Channel Islands National Marine Sanctuary. The dataset provides information on the distribution of benthic resources, specifically substrate type, to inform policy decisions on marine zoning and boundary adjustments. The assessment is part of a series initiated for sanctuaries along the U.S. west coast.
PaddleOCR-part2-output1 is a dataset published on Kaggle. The title suggests it contains output from the PaddleOCR optical character recognition system, likely consisting of processed images and extracted text. Specific details regarding the dataset's size, origin, and creation date are not provided in the available metadata.
Japan's GENIAC Project, promoted by the Ministry of Economy, Trade and Industry and NEDO, produced this dataset. It is an evaluation dataset for the CSV-to-IDS task, used to assess the Ishigaki-IDS model and evaluate whether an LLM can generate appropriate IDS from CSV. The dataset was authored by ONESTRUCTION and last updated on March 26, 2026.
varcnn-test-idx is a dataset hosted on Kaggle. Its title suggests it may be used for testing convolutional neural network models. The dataset's specific content, size, and origin are not detailed in the available metadata.