Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
NJU-LINK's OmniVideoBench is a large-scale benchmark dataset designed to evaluate multimodal large language models on joint audio and visual reasoning tasks. It addresses a gap in existing benchmarks that often focus on a single modality. The dataset was last updated on April 8, 2026.
The full description is hosted externally; users must visit the Hugging Face dataset page for complete details.