Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Spatial457 contains between 10,000 and 100,000 image-text pairs designed for 6D spatial reasoning diagnostics. Created by researchers at Johns Hopkins University and DEVCOM Army Research Laboratory in 2025, it benchmarks the ability of multimodal models to interpret 3D orientations. The data is released under an Apache 2.0 license.
Associated with Arxiv paper 2502.08636; users should refer to the project page for specific implementation details.