Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Mantis-Instruct contains 721,000 instruction tuning examples across 14 specialized subsets. It is a fully interleaved text-image dataset designed for training multimodal models on skills like co-reference, reasoning, and temporal understanding. The dataset was created by TIGER-Lab for training the Mantis model families.
The dataset description references a full description on an external Hugging Face page; users may need to visit that page for complete details on structure, format, and license.