Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
LLaVAR provides a collection of 422,000 pretraining and 16,000 to 20,000 instruction-following data pairs for training multimodal AI models. Created by SALT-NLP, this dataset enhances visual instruction tuning by focusing on images containing text. The dataset was released and last updated in July 2023.
The full dataset description and access details are on the Hugging Face dataset page; users must visit the provided link for complete information. License information is not specified in the provided input.