Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
MulSeT is a benchmark dataset designed to challenge multimodal large language models (MLLMs) on spatial reasoning tasks. It requires models to integrate information from two distinct viewpoints of a 3D scene to answer questions. The dataset was created by WanyueZhang and was last updated on the Hugging Face platform in November 2025.
License is unknown; terms must be verified before use.