Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Evaluation benchmarks for the Video-R1 model across video reasoning categories, including test sets for temporal and causal logic. The dataset provides the data required to replicate the reasoning performance results presented in the 'Video-R1: Reinforcing Video Reasoning in MLLMs' research paper. It is designed to test the logical and temporal inference capabilities of Multimodal Large Language Models.