LLaVA Instruct Mix: 100K-1M Multimodal Conversational Records

Name: LLaVA Instruct Mix: 100K-1M Multimodal Conversational Records
Creator: trl-lib
Published: 2025-08-09T04:27:42
Keywords: Librarypolars, Librarydask, Modalitytext, Size Categories100 Kn1 M, Librarymlcroissant, Trl, Modalityimage, Librarydatasets, Parquet, Regionus

by trl-libUpdated 11mo ago

Available on 1 platform

Sign in to view source links and access this dataset

Description

Between 100,000 and 1,000,000 multimodal conversational records comprise this dataset released by trl-lib in 2025. It facilitates instruction tuning by pairing images with multi-turn dialogue prompts and target completions. The data is structured specifically for language modeling and visual-text alignment tasks.

Use Cases

Training multimodal models to interpret the 'images' column in the context of a 'prompt'
Fine-tuning conversational agents using the 'completion' field as the target response
Benchmarking visual reasoning by analyzing the relationship between 'images' and the conversational history

Strengths

Scale of 100,000 to 1,000,000 records
Standardized conversational schema with 'prompt' and 'completion' fields
Parquet format for efficient data loading and processing

Limitations

Lack of detailed documentation regarding the specific data sources included in the mix
Potential for geographic bias as metadata indicates a US region focus
Unknown license status may restrict commercial use

Provenance

Source: trl-lib
Collection Method: Processed version of LLaVA Instruct Mix
Freshness: Last updated August 16, 2025.
Geography: United States

This dataset is a processed version specifically formatted for use with the TRL (Transformer Reinforcement Learning) library. Users should ensure their environment supports Parquet and multimodal data loading.

Parquet Librarypolars Librarydask Modalitytext Size Categories100 Kn1 M Librarymlcroissant Trl Modalityimage Librarydatasets Regionus

Related Datasets

Quality Score

D38

Description

39

Source

36

Reputation

48

Access

22

Community

1.3K downloads

3 likes

0 views

Dataset Info

Author: trl-lib
Created: Aug 9, 2025
Updated: Aug 16, 2025
Last synced: May 30, 2026

Access

22

Community

1.3K downloads

3 likes

0 views

Dataset Info

Author: trl-lib
Created: Aug 9, 2025
Updated: Aug 16, 2025
Last synced: May 30, 2026

LLaVA Instruct Mix: 100K-1M Multimodal Conversational Records

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info