LLaVA CoT 25K: Multimodal Instruction-Following Dataset

Name: LLaVA CoT 25K: Multimodal Instruction-Following Dataset
Creator: tomkld
Published: 2024-12-10T10:20:42
Keywords: Size Categories10 Kn100 K, Image Text Pairs, Librarypolars, Modalitytext, Librarymlcroissant, Vision Language, Multimodal Ai, Modalityimage, Librarydatasets, Librarypandas, Regionus, JSON, Multimodal

by tomkldUpdated 1y ago

Available on 1 platform

Sign in to view source links and access this dataset

Description

25,000 multimodal examples likely containing images paired with text instructions and chain-of-thought reasoning. The dataset was created by author 'tomkld' and last updated on Hugging Face on December 10, 2024. Its columns suggest it contains image and text data for training vision-language models.

Use Cases

Fine-tune a vision-language model on instruction-following tasks (inferred from domain, verify after download)
Benchmark model performance on visual reasoning with chain-of-thought (inferred from domain, verify after download)
Generate synthetic multimodal training data for alignment (inferred from domain, verify after download)

Strengths

Published on Hugging Face, a major platform for ML datasets.
Last updated on 2024-12-10, indicating recent maintenance.
Platform tags indicate it contains both image and text modalities.

Limitations

Metadata is minimal; actual content requires verification after download.
Row count, column definitions, and license are unknown.
Data may reflect bias inherent to its unspecified collection source and method.

Provenance

Source: tomkld (author on Hugging Face)
Collection Method: Collection method is unknown.
Time Range: Temporal coverage is unknown.
Freshness: Last updated 2024-12-10 10:53:19
Geography: Spatial coverage is unknown.

License is unknown, which may restrict commercial or research use.

Multimodal JSON Size Categories10 Kn100 K Image Text Pairs Librarypolars Modalitytext Librarymlcroissant Vision Language Multimodal Ai Modalityimage Librarydatasets Librarypandas Regionus

Related Datasets

Quality Score

D24

Description

10

Source

36

Reputation

27

Access

22

Community

10 downloads

2 likes

0 views

Dataset Info

Author: tomkld
Created: Dec 10, 2024
Updated: Dec 10, 2024

Access

22

Community

10 downloads

2 likes

0 views

Dataset Info

Author: tomkld
Created: Dec 10, 2024
Updated: Dec 10, 2024

LLaVA CoT 25K: Multimodal Instruction-Following Dataset

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info