Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Between 100,000 and 1,000,000 multimodal conversational records comprise this dataset released by trl-lib in 2025. It facilitates instruction tuning by pairing images with multi-turn dialogue prompts and target completions. The data is structured specifically for language modeling and visual-text alignment tasks.
This dataset is a processed version specifically formatted for use with the TRL (Transformer Reinforcement Learning) library. Users should ensure their environment supports Parquet and multimodal data loading.