Conceptual Captions 12

Name: Conceptual Captions 12
Creator: flax-community
Published: 2022-03-02T23:29:22
Keywords: Librarypolars, Languageen, Size Categories10 Mn100 M, Modalitytext, CSV, Librarymlcroissant, Modalityimage, Librarydatasets, Librarypandas, Regionus

by flax-communityUpdated 2y ago

Available on 1 platform

Sign in to view source links and access this dataset

Description

12,000,000 English image-caption pairs derived from Google's Conceptual 12M dataset. The collection is structured in a TSV format containing image URLs, local filenames, and descriptive captions for each entry.

Use Cases

Train multimodal embedding models using the image link and caption columns.
Fine-tune text-to-image synthesis models by pairing the caption text with downloaded images.
Benchmark image retrieval systems using the provided English captions as search queries.

Strengths

12,000,000 rows of image-text associations for large-scale model training.
TSV file structure containing image links, downloaded file names, and captions.
Cleaned data specifically optimized for TPU-VM environments.

CSV Librarypolars Languageen Size Categories10 Mn100 M Modalitytext Librarymlcroissant Modalityimage Librarydatasets Librarypandas Regionus

Related Datasets

Quality Score

D28

Description

24

Source

36

Reputation

24

Access

22

Community

693 downloads

4 likes

0 views

Dataset Info

Author: flax-community
Created: Mar 2, 2022
Updated: Jan 13, 2024
Last synced: Jul 26, 2026

Access

22

Community

693 downloads

4 likes

0 views

Dataset Info

Author: flax-community
Created: Mar 2, 2022
Updated: Jan 13, 2024
Last synced: Jul 26, 2026

Conceptual Captions 12

Description

Use Cases

Strengths

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info