Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
LVBench is a benchmark for long video understanding featuring videos up to two hours in duration, released by zai-org in June 2024. It contains approximately 1,000 records designed to evaluate multimodal models on visual question answering and multiple-choice tasks. The dataset addresses the challenge of extracting information from extended temporal windows that exceed standard video benchmarks.
Users should be prepared for significant storage and compute requirements due to the two-hour video durations; the dataset is governed by a Creative Commons Attribution Non-Commercial Share Alike 4.0 license.