Loading...
Loading...
Image classification, object detection, segmentation, face recognition, OCR, image generation, video understanding
16,012 datasets
Acheron recaptioned image dataset is intended for training Stable Diffusion or FLUX LoRA models. The dataset contains images from the video game Honkai: Star Rail. The specific volume, source, and update details are unknown.
Yae Miko recaptioned image dataset for Stable Diffusion and FLUX LoRA training. The dataset is hosted on Kaggle. The author, organization, and last update date are unknown.
An image dataset featuring the character Lumine from the video game Genshin Impact. The images have been recaptioned for use in training Stable Diffusion or FLUX Low-Rank Adaptation (LoRA) models. The dataset was published on Kaggle, but its author, size, and update date are unknown.
MSE-Bench consists of 100 test instances designed to evaluate multi-turn image editing systems under realistic workflows. It was created by leigangqu and hosted on Hugging Face, with a last recorded update on 2026-03-19. The benchmark provides a source image and a series of editing instructions for models to apply cumulatively.
Cached outputs from a TrOCR model's beam search process, intended for validating a reranker component. The dataset appears to be a technical artifact from a machine learning pipeline, shared on Kaggle. Specific details on its creation date, author, and size are not provided.
A collection of photo albums with categorized photos intended for image classification tasks. The dataset is hosted on Kaggle, but its author, size, and specific contents are unspecified. The last update date and license information are also unknown.
A collection of model weights for YOLO (You Only Look Once) object detection architectures, published on Kaggle. The specific version, training data, and performance metrics are not detailed in the provided metadata. Users must download the files to verify the exact model variants and their intended applications.
Datasetpisang_hsv_cnn is an image dataset published on Kaggle. The title suggests it contains images for training convolutional neural networks, possibly utilizing HSV color space features. No further metadata on size, origin, or specific content is available.
10,000 high-quality images of staged individuals with visible weapons, sourced from public CCTV footage and the internet. The dataset features guns, pistols, and other weapons, designed for training detection models. It was created by UniDataPro and last updated on 2026-02-23.
Weekly 2018 retail scan data for Hass avocado volume and price across multiple U.S. regions, compiled by the Hass Avocado Board. The data, downloaded in May 2018, reflects sales from grocery, mass, club, drug, dollar, and military outlets. It includes average price per avocado, sales volume by product code, and distinguishes between conventional and organic types.
Kaggle hosts a dataset of images featuring the character Shenhe from the video game Genshin Impact. The images have been recaptioned specifically for training Stable Diffusion or FLUX Low-Rank Adaptation models. The dataset's size, license, and author are unspecified.
A recaptioned image dataset sourced from the video game Shadow Fight. The dataset appears to be a modified version of an existing collection, likely involving new textual annotations for the images. Its author, organization, and specific scale are unknown.
June Nonmain 2.5D recaptioned image dataset from Shadow Fight. The dataset appears to consist of images from the Shadow Fight game series. Its specific size, origin, and update history are not detailed in the provided metadata.
Recaptioned images from the video game Shadow Fight, focusing on the June Tag Nonmain subset. The dataset likely contains visual assets from the game with new textual descriptions. Its origin and size are unspecified.
YOLO-11 is a dataset published on Kaggle. The title suggests it is related to the YOLO (You Only Look Once) family of object detection models. The dataset's specific content, size, and origin are not detailed in the provided metadata.
YOLO-26 is a dataset published on Kaggle. Its specific contents and scale are not described in the available metadata. The dataset's author, organization, and last update date are unknown.
600 Arabic paragraph images are paired with ground truth text for evaluating optical character recognition models. The dataset is structured into training, validation, and test splits, with models evaluated using Character Error Rate (CER) and Word Error Rate (WER). It was uploaded by sedra-hugface and last updated on March 17, -2026.
YOLO Runs Results is a dataset published on Kaggle. It likely contains performance metrics and outputs from training or inference runs of YOLO (You Only Look Once) object detection models. The specific contents, scale, and origin of the data are not detailed in the available metadata.
Personix-Octo is a multi-theme image classification dataset containing 8,355 JPEG images across 9 distinct visual categories. Created by Poralus and updated in March 2026, the collection provides approximately 850 images per theme for benchmarking computer vision models. The dataset supports high-resolution workflows, including 4K resolution options.
Bangladeshi Weed Image Dataset contains 9,190 images across 50 weed species. The dataset is hosted on Kaggle, but the author, organization, and license details are unknown. The last update date and specific collection methodology are also not provided.