Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
MindCube is a benchmark for evaluating Vision Language Models' ability to form spatial mental models from limited visual information. It contains 21,154 questions across 3,268 images, created by MLL-Lab. The dataset was last updated in November 2025.
License information is unknown. The full dataset description is hosted externally on Hugging Face.