Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
A dataset designed for instruction tuning in multimodal settings involving visual interaction data. It was created by nyu-visionx and released in 2024 to address the scarcity of high-quality multimodal instruction-tuning data. The dataset aims to maintain the language abilities of multimodal large language models.
License is unknown; users must verify terms before use.