DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

ImageInWords: Hyper-Detailed Image Descriptions | DataSalon

Home Multimodal & LLMImageInWords: Hyper-Detailed Image Descriptions

Multimodal & LLM

ImageInWords: Hyper-Detailed Image Descriptions

Name: ImageInWords: Hyper-Detailed Image Descriptions
Creator: google
Published: 2024-03-06T03:30:17
Keywords: I2t, Image To Text, Human Annotation, Evaluation, Image Text, T2i, Detailed Descriptions, Image Captioning, Dataset Generation, Image Descriptions, Detailed Annotations

by google / google·Updated 1y ago

Available on 1 platform

Description

10,000+ hyper-detailed image descriptions and object-level annotations derived from the Open Images dataset. The data includes fine-grained attributes, spatial relationships, and dense scene narratives designed to improve vision-language model alignment.

Use Cases

Fine-tune vision-language models for dense captioning using the detailed scene description field.
Benchmark large vision-language models (LVLMs) on their ability to identify specific object attributes and spatial arrangements.
Train text-to-image models to follow complex, multi-object prompts based on the provided ground-truth descriptions.

Strengths

Contains over 10,000 images with human-refined, dense descriptions.
Includes object-level metadata such as bounding boxes and specific attribute labels for every entity mentioned.
Features a multi-stage annotation pipeline that integrates machine-generated drafts with expert human editing for factual precision.

I2t Image To Text Human Annotation Evaluation Image Text T2i Detailed Descriptions Image Captioning Dataset Generation Image Descriptions Detailed Annotations

Related Datasets

Quality Score

D24

Description

Source

Reputation

Quality Score

D24

Description

Source

Reputation

Access

Community

227 likes

0 views

Dataset Info

Author: google
Org: google
Created: Mar 6, 2024
Updated: Nov 17, 2024
Language: JavaScript
Last synced: Jun 23, 2026

Access

Community

227 likes

0 views

Dataset Info

Author: google
Org: google
Created: Mar 6, 2024
Updated: Nov 17, 2024
Language: JavaScript
Last synced: Jun 23, 2026

ImageInWords: Hyper-Detailed Image Descriptions

Description

Use Cases

Strengths

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info