LVBench: Long Video Understanding Benchmark with Two-Hour Durations

Name: LVBench: Long Video Understanding Benchmark with Two-Hour Durations
Creator: zai-org
Published: 2024-06-11T13:57:35
Keywords: Task Categoriesmultiple Choice, Languageen, Task Categoriesvisual Question Answering, Licensecc By Nc Sa 40, Size Categoriesn1 K, Arxiv240608035, Librarymlcroissant, Modalityimage, Librarydatasets, Modalityvideo, Regionus, Video, IMAGEFOLDER

by zai-orgUpdated 2y ago

Available on 1 platform

Sign in to view source links and access this dataset

Description

LVBench is a benchmark for long video understanding featuring videos up to two hours in duration, released by zai-org in June 2024. It contains approximately 1,000 records designed to evaluate multimodal models on visual question answering and multiple-choice tasks. The dataset addresses the challenge of extracting information from extended temporal windows that exceed standard video benchmarks.

Use Cases

Evaluating multimodal model performance on long-form video comprehension using multiple-choice questions
Testing information extraction capabilities across two-hour video durations
Benchmarking visual question answering (VQA) systems on temporal reasoning

Strengths

Includes videos with durations up to two hours
CC BY-NC-SA 4.0 licensed for research use
Features approximately 1,000 annotated records for benchmarking

Limitations

Small sample size of approximately 1,000 records
Non-commercial license restriction (CC BY-NC-SA 4.0)
High computational requirements for processing two-hour video files

Provenance

Source: zai-org (ArXiv: 2406.08035)
Freshness: Released June 2024; last updated June 13, 2024.

Users should be prepared for significant storage and compute requirements due to the two-hour video durations; the dataset is governed by a Creative Commons Attribution Non-Commercial Share Alike 4.0 license.

Video IMAGEFOLDER Task Categoriesmultiple Choice Languageen Task Categoriesvisual Question Answering Licensecc By Nc Sa 40 Size Categoriesn1 K Arxiv240608035 Librarymlcroissant Modalityimage Librarydatasets Modalityvideo Regionus

Related Datasets

Quality Score

D35

Description

36

Source

36

Reputation

39

Access

22

Community

1.2K downloads

12 likes

0 views

Dataset Info

Author: zai-org
Created: Jun 11, 2024
Updated: Jun 13, 2024
Last synced: Jun 17, 2026

Access

22

Community

1.2K downloads

12 likes

0 views

Dataset Info

Author: zai-org
Created: Jun 11, 2024
Updated: Jun 13, 2024
Last synced: Jun 17, 2026

LVBench: Long Video Understanding Benchmark with Two-Hour Durations

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info