Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Published on huggingface by author zaiquan and last updated on 2025-12-04. The dataset likely contains multimodal data for spatio-temporal video grounding tasks, which involve linking language queries to specific objects and time segments in videos. Its specific content, scale, and collection methodology require verification after download.
License is unknown; users must verify terms of use before application.