Loading...
Loading...
Image classification, object detection, segmentation, face recognition, OCR, image generation, video understanding
15,993 datasets
A dataset for face anti-spoofing detection, likely containing images or video frames. It is published on Kaggle, but the author, organization, and creation date are unknown. The dataset's size, format, and specific annotation schema are not detailed in the available metadata.
A dataset likely containing images for training or analyzing Convolutional Neural Network models. It is hosted on Kaggle, but its specific content, size, and origin are not detailed in the available metadata. The dataset's creation date, author, and exact scope require verification after download.
Kaggle hosts this dataset titled 'Yolo_GCV2'. The title suggests it is likely related to object detection, potentially for training or evaluating YOLO (You Only Look Once) models. The dataset's specific content, size, and origin are not detailed in the provided metadata.
A 10-class dataset of date fruit images. It includes pre-trained PyTorch ResNet-50 model checkpoints and source code for classification tasks. The dataset was sourced from Kaggle, but specific details about its origin, size, and collection date are not provided.
An image dataset annotated for object detection of military and aerial targets. The dataset is designed for training YOLOv8 models to identify objects such as F-16 fighter jets, helicopters, drones, and rockets. The author, organization, and specific scale of the dataset are unknown.
Justin Johnson's Caffe-converted VGG is a dataset of pre-trained model weights for the VGG convolutional neural network architecture. The weights have been converted for use with the Caffe deep learning framework. It is hosted on the Kaggle platform.
A dataset of battery surface images annotated for object detection, likely for quality control in manufacturing. It is hosted on Kaggle and appears to be formatted for use with the YOLO (You Only Look Once) object detection framework. The specific number of images, annotation details, and creation date are unknown from the provided metadata.
Part 5 of a series, this dataset contains preprocessed medical images related to cancer. It is hosted on Kaggle, but the specific imaging modality, number of samples, and collection details are not provided in the metadata. The preprocessing steps applied to the images are also unspecified.
Results from the California Environmental Data Exchange Network (CEDEN) provide tissue analysis data for individual and composite aquatic organism samples. The dataset includes provisional data quality indicators to assist with metadata interpretation. Data is split into annual resources due to file size constraints.
Rex-Omni is a 3-billion-parameter Multimodal Large Language Model that frames object detection and other visual perception tasks as a next-token prediction problem. The model was authored by qq-2 and its AWQ quantized version was released on October 31, 2025. The dataset page was last updated on March 4, 2026.
135 pregnant women with sickle cell disease were studied in a cohort. The dataset contains quantitative data on support systems, adherence to antenatal care visit schedules, and fetal pregnancy outcomes. It was authored by Jackline Akello and last updated on 2026-04 13.
100,000 images across 200 classes, with 500 training, 50 validation, and 50 test images per class. The dataset consists of 64x64 colored images and was created by Jiayu Wu, Qixiang Zhang, and Guoxi Xu for a Stanford CS231n project. It is licensed under DbCL v1.0.
IRS Form 990 financial extracts, functional expenses, and executive salaries for nonprofit healthcare organizations in the United States. The dataset is hosted on Kaggle, but its author, organization, and last update date are unknown. The specific number of rows, file formats, and license details are also unspecified.
StoryMaps from the Department of the Interior, last updated March 2026, explore the history of racial integration, segregation, and civil rights at public golf courses in the District of Columbia. The reference contains links to multiple interactive StoryMaps, including 'Golf and Civil Rights in Washington, DC' and 'The Democracy of Golf'. The data is provided via an API from the datagov platform.
ESRI shape files of National Park Service tract and boundary data created by the Bureau of Land Management's GCDB for the Midwest Regional Office. The data shows properties owned by the NPS and those where it holds interests like scenic easements or rights of way. It was last updated on March 4, 2026.
Ego-1K is a collection of 956 short egocentric videos captured by Meta using a custom 12-camera synchronous rig for 3D video synthesis research. The dataset contains 491,000 frames and 5.9 million images documenting hand motions and dynamic scenes. Each video sequence lasts between 6.7 and 9.7 seconds, providing high-density multiview coverage from a VR headset perspective.
HYDRA-SR ImageNet SR Dataset Chunk 001 is a dataset for super-resolution tasks, likely derived from the ImageNet collection. It is published on Kaggle, but specific details about its size, creation method, and update history are not provided in the available metadata. The dataset appears to be one part of a larger collection, as indicated by the 'Chunk 001' designation.
A chunk of a dataset for image super-resolution tasks, likely derived from the ImageNet collection. Published on Kaggle, its specific size, creation date, and author are unknown. The dataset appears to contain images processed for resolution enhancement.
HYDRA-SR ImageNet SR Dataset Chunk 003 is a dataset of images for super-resolution tasks, likely derived from the ImageNet collection. It is published on Kaggle, but the author, organization, and creation date are unknown. The dataset's size, specific contents, and file formats are not detailed in the available metadata.
HYDRA-SR ImageNet SR Dataset Chunk 004 is a dataset for image super-resolution tasks, published on Kaggle. The dataset's title suggests it is part of a larger collection derived from the ImageNet benchmark, likely containing images processed for resolution enhancement. Metadata is minimal; specifics on size, format, and annotations require verification after download.