Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
A dataset created by remyxai and last updated on April 23, 2025. It is designed for training LLaVA-style Vision-Language Models and contains synthesized spatial reasoning traces. The data was generated using VQASynth from a subset of images in the localized narratives split of the cauldron.
License is unknown; terms of use must be verified before application.