Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
SpatialVID provides between 1 million and 10 million video records paired with spatial annotations, developed by researchers at Nanjing University and the Chinese Academy of Science for CVPR 2026. The data supports multi-modal generative tasks by linking video sequences with 3D spatial metadata and English text descriptions.
Released under CC BY-NC-SA 4.0 license. Requires citation of the CVPR 2026 paper (Arxiv 2509.09676).