DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

Computer Vision Datasets | DataSalon

All Categories

👁️

Computer Vision

Image classification, object detection, segmentation, face recognition, OCR, image generation, video understanding

16,012 datasets

Computer Vision

Recaptioned Acheron: Honkai Star Rail Images for AI Training

Acheron recaptioned image dataset is intended for training Stable Diffusion or FLUX LoRA models. The dataset contains images from the video game Honkai: Star Rail. The specific volume, source, and update details are unknown.

ImageTextStable DiffusionComputer VisionImage CaptioningAi TrainingFlux Lora+1

0 views

Computer Vision

Recaptioned Yae Miko Images for Stable Diffusion and FLUX LoRA Training

Yae Miko recaptioned image dataset for Stable Diffusion and FLUX LoRA training. The dataset is hosted on Kaggle. The author, organization, and last update date are unknown.

ImageGenshin ImpactStable DiffusionComputer VisionAi TrainingCharacter ImagesFlux Lora+1

0 views

Computer Vision

Recaptioned Lumine Genshin Impact Images for AI Training

An image dataset featuring the character Lumine from the video game Genshin Impact. The images have been recaptioned for use in training Stable Diffusion or FLUX Low-Rank Adaptation (LoRA) models. The dataset was published on Kaggle, but its author, size, and update date are unknown.

ImageGenshin ImpactLora TrainingStable DiffusionComputer Vision+1

0 views

Computer Vision

MSE-Bench: A Benchmark for Multi-turn Session Image Editing

MSE-Bench consists of 100 test instances designed to evaluate multi-turn image editing systems under realistic workflows. It was created by leigangqu and hosted on Hugging Face, with a last recorded update on 2026-03-19. The benchmark provides a source image and a series of editing instructions for models to apply cumulatively.

MultimodalOPTIMIZED-PARQUETParquetLibrarypolarsArxiv250610941Size Categoriesn1 KModalitytextLibrarymlcroissantEvaluationModalityimageLibrarydatasetsBenchmarkLibrarypandasComputer VisionRegionusLicenseapache 20+1

0 views

Computer Vision

TrOCR Beam Reranker Validation Output

Cached outputs from a TrOCR model's beam search process, intended for validating a reranker component. The dataset appears to be a technical artifact from a machine learning pipeline, shared on Kaggle. Specific details on its creation date, author, and size are not provided.

TabularMachine LearningValidationOCRBeam Search+1

0 views

Computer Vision

Photo Albums with Categorized Images for Computer Vision

A collection of photo albums with categorized photos intended for image classification tasks. The dataset is hosted on Kaggle, but its author, size, and specific contents are unspecified. The last update date and license information are also unknown.

ImageCategorized ImagesPhoto AlbumsComputer Vision+1

0 views

Computer Vision

YOLO Weights for Object Detection Models

A collection of model weights for YOLO (You Only Look Once) object detection architectures, published on Kaggle. The specific version, training data, and performance metrics are not detailed in the provided metadata. Users must download the files to verify the exact model variants and their intended applications.

ImageComputer VisionObject DetectionModel Weights+1

0 views

Computer Vision

Datasetpisang Hsv Cnn: Image Data for Computer Vision

Datasetpisang_hsv_cnn is an image dataset published on Kaggle. The title suggests it contains images for training convolutional neural networks, possibly utilizing HSV color space features. No further metadata on size, origin, or specific content is available.

ImageComputer VisionImage ClassificationHsv Color SpaceCnn+1

0 views

Computer Vision

Weapon Detection Dataset with 10,000 Staged Images from CCTV and Internet Sources

10,000 high-quality images of staged individuals with visible weapons, sourced from public CCTV footage and the internet. The dataset features guns, pistols, and other weapons, designed for training detection models. It was created by UniDataPro and last updated on 2026-02-23.

ImageSmall ObjectsVideo SurveillanceComputer VisionObject DetectionWeapon Detection+1

0 views

Computer Vision

Weekly U.S. Avocado Retail Sales and Prices from the Hass Avocado Board

Weekly 2018 retail scan data for Hass avocado volume and price across multiple U.S. regions, compiled by the Hass Avocado Board. The data, downloaded in May 2018, reflects sales from grocery, mass, club, drug, dollar, and military outlets. It includes average price per avocado, sales volume by product code, and distinguishes between conventional and organic types.

TabularTime SeriesRetail SalesAvocado PricesAgricultural Economics+1

0 views

Computer Vision

Shenhe Character Images with Recaptions for AI Model Training

Kaggle hosts a dataset of images featuring the character Shenhe from the video game Genshin Impact. The images have been recaptioned specifically for training Stable Diffusion or FLUX Low-Rank Adaptation models. The dataset's size, license, and author are unspecified.

ImageTextGenshin ImpactCharacter TrainingStable DiffusionComputer VisionImage Captioning+1

0 views

Computer Vision

Recaptioned June Tag Main Shadow Fight: Image Dataset from a Video Game

A recaptioned image dataset sourced from the video game Shadow Fight. The dataset appears to be a modified version of an existing collection, likely involving new textual annotations for the images. Its author, organization, and specific scale are unknown.

ImageRecaptionedShadow FightComputer VisionVideo Game+1

0 views

Computer Vision

Recaptioned June Tag Nonmain 2.5D Shadow Fight Images

June Nonmain 2.5D recaptioned image dataset from Shadow Fight. The dataset appears to consist of images from the Shadow Fight game series. Its specific size, origin, and update history are not detailed in the provided metadata.

ImageRecaptioned ImagesShadow FightComputer VisionGame Assets2 5d+1

0 views

Computer Vision

Recaptioned June Tag Nonmain Shadow Fight: Video Game Image Dataset

Recaptioned images from the video game Shadow Fight, focusing on the June Tag Nonmain subset. The dataset likely contains visual assets from the game with new textual descriptions. Its origin and size are unspecified.

ImageShadow FightComputer VisionImage CaptioningVideo Game+1

0 views

Computer Vision

YOLO-11: Object Detection Dataset

YOLO-11 is a dataset published on Kaggle. The title suggests it is related to the YOLO (You Only Look Once) family of object detection models. The dataset's specific content, size, and origin are not detailed in the provided metadata.

ImageYoloComputer VisionObject Detection+1

0 views

Computer Vision

YOLO-26: Object Detection Dataset

YOLO-26 is a dataset published on Kaggle. Its specific contents and scale are not described in the available metadata. The dataset's author, organization, and last update date are unknown.

ImageYoloComputer VisionObject Detection+1

0 views

Computer Vision

Arabic Handwritten OCR Evaluation Dataset with 600 Paragraph Images

600 Arabic paragraph images are paired with ground truth text for evaluating optical character recognition models. The dataset is structured into training, validation, and test splits, with models evaluated using Character Error Rate (CER) and Word Error Rate (WER). It was uploaded by sedra-hugface and last updated on March 17, -2026.

ImageTextHandwritten TextEvaluationBenchmarkOptical Character RecognitionComputer VisionArabic Ocr+1

0 views

Computer Vision

YOLO Runs Results: Object Detection Model Performance

YOLO Runs Results is a dataset published on Kaggle. It likely contains performance metrics and outputs from training or inference runs of YOLO (You Only Look Once) object detection models. The specific contents, scale, and origin of the data are not detailed in the available metadata.

TabularYoloModel EvaluationComputer VisionObject Detection+1

0 views

Computer Vision

Personix-Octo: 8,355 Multi-Theme Images for Classification

Personix-Octo is a multi-theme image classification dataset containing 8,355 JPEG images across 9 distinct visual categories. Created by Poralus and updated in March 2026, the collection provides approximately 850 images per theme for benchmarking computer vision models. The dataset supports high-resolution workflows, including 4K resolution options.

Size Categories1 Kn10 KLanguageenMulti ThemePersonix OctoTask Categoriesimage ClassificationComputer VisionRegionusImage ClassificationJpegLicenseapache 20+1

0 views

Computer Vision

Bangladeshi Weed Images for 50 Plant Species

Bangladeshi Weed Image Dataset contains 9,190 images across 50 weed species. The dataset is hosted on Kaggle, but the author, organization, and license details are unknown. The last update date and specific collection methodology are also not provided.

ImagePlant SpeciesComputer VisionAgricultureWeed IdentificationBangladesh+1

0 views

PreviousPage 410 of 798Next