DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

Mscoco 2014 5K Test Image Text Retrieval | DataSalon

Home Multimodal & LLMMscoco 2014 5K Test Image Text Retrieval

Multimodal & LLM

Mscoco 2014 5K Test Image Text Retrieval

Name: Mscoco 2014 5K Test Image Text Retrieval
Creator: nlphuji
Published: 2023-01-12T14:37:24
Keywords: Size Categories1 Kn10 K, Arxiv14050312, Modalitytext, Librarymlcroissant, Modalityimage, Librarydatasets, Regionus

by nlphuji·Updated 3y ago

Description

5,000 test images from the MSCOCO 2014 collection paired with human-annotated captions for image-text retrieval tasks. The data follows the Karpathy split, a standard benchmark for evaluating cross-modal alignment between visual features and natural language descriptions.

Use Cases

Calculate Recall@K metrics for image-to-text retrieval by ranking caption strings against image embeddings.
Benchmark text-to-image retrieval systems using the 5,000 images as a search corpus.
Validate the performance of image captioning models by comparing generated text to the ground-truth human annotations.

Strengths

5,000 unique images sourced from the MSCOCO 2014 validation set.
Includes multiple natural language captions per image for many-to-many retrieval evaluation.
Formatted specifically for the Karpathy split as defined in the Stanford deepimagesent repository.

Size Categories1 Kn10 K Arxiv14050312 Modalitytext Librarymlcroissant Modalityimage Librarydatasets Regionus

Related Datasets

Quality Score

D37

Description

Source

Reputation

Quality Score

D37

Description

Source

Reputation

Access

Community

1.8K downloads

11 likes

0 views

Dataset Info

Author: nlphuji
Created: Jan 12, 2023
Updated: Jan 18, 2023

Access

Community

1.8K downloads

11 likes

0 views

Dataset Info

Author: nlphuji
Created: Jan 12, 2023
Updated: Jan 18, 2023

Mscoco 2014 5K Test Image Text Retrieval

Description

Use Cases

Strengths

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info