25M Img Caps: 25 Million Image-Caption Pairs

Name: 25M Img Caps: 25 Million Image-Caption Pairs
Creator: csarron
Published: 2022-03-02T23:29:22
Keywords: Regionus

by csarronUpdated 4y ago

Description

25,000,000 image-caption pairs structured for large-scale multimodal model training. The collection expands upon the 4M Img Caps framework to provide a higher volume of text-image associations for vision-language tasks.

Use Cases

Train zero-shot image classifiers using the text and image alignment data
Develop automated image captioning systems by mapping visual inputs to the provided text strings
Benchmark text-to-image retrieval performance across 25 million potential candidates

Strengths

25,000,000 unique image-caption records
Compatible with the data loading scripts and schema used for the 4M Img Caps dataset
Optimized for large-scale pre-training of multimodal transformers and CLIP-style models

Regionus

Related Datasets

Quality Score

D25

Description

22

Source

36

Reputation

12

Access

22

Community

166 downloads

1 likes

0 views

Dataset Info

Author: csarron
Created: Mar 2, 2022
Updated: Mar 28, 2022
Last synced: Apr 29, 2026

Access

22

Community

166 downloads

1 likes

0 views

Dataset Info

Author: csarron
Created: Mar 2, 2022
Updated: Mar 28, 2022
Last synced: Apr 29, 2026

25M Img Caps: 25 Million Image-Caption Pairs

Description

Use Cases

Strengths

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info