Megalith 10M Florence2

Name: Megalith 10M Florence2
Creator: aipicasso
Published: 2024-07-29T02:16:42
Keywords: Librarypolars, Task Categoriesimage To Text, Size Categories1 Mn10 M, Languageen, Task Categoriestext To Image, Modalitytext, CSV, Librarymlcroissant, Librarydatasets, Librarypandas, Regionus, Licensemit

by aipicassoUpdated 2y ago

Available on 1 platform

Sign in to view source links and access this dataset

Description

10,000,000 image-caption pairs generated using the Florence-2 vision-language model for the Megalith-10M image collection. Textual descriptions supplement the previously uncaptioned CC-0 like images to support vision-language model training.

Use Cases

Train text-to-image diffusion models using the Florence-2 captions as training prompts
Fine-tune vision-language models for image captioning or visual question answering using the paired image and text data
Build image search engines by mapping the Florence-2 text descriptions to the original Megalith-10M images

Strengths

10,000,000 image-caption pairs
Captions generated using the Florence-2 vision-language model
Derived from the CC-0 like Megalith-10M image repository

CSV Librarypolars Task Categoriesimage To Text Size Categories1 Mn10 M Languageen Task Categoriestext To Image Modalitytext Librarymlcroissant Librarydatasets Librarypandas Regionus Licensemit

Related Datasets

Quality Score

D35

Description

39

Source

36

Reputation

37

Access

22

Community

108 downloads

25 likes

0 views

Dataset Info

Author: aipicasso
Created: Jul 29, 2024
Updated: Jul 31, 2024
Last synced: Jun 29, 2026

Access

22

Community

108 downloads

25 likes

0 views

Dataset Info

Author: aipicasso
Created: Jul 29, 2024
Updated: Jul 31, 2024
Last synced: Jun 29, 2026

Megalith 10M Florence2

Description

Use Cases

Strengths

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info