Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
1-hour videos and v1.0 development set annotations for long-form video-language understanding. This benchmark from Stanford University was introduced at NeurIPS 2024 to evaluate models on extended temporal sequences.