Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
SPAR-Bench contains 7,207 manually verified spatial reasoning question-answer pairs across 20 distinct tasks, released by jasonzhango in 2025. The benchmark evaluates vision-language models using single-view, multi-view, and video modalities to test spatial perception and reasoning capabilities.
Associated with Arxiv paper 2503.22976; data is provided in Parquet format and requires standard VLM evaluation pipelines.