Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
A subset of Google DeepMind's RoboVQA dataset, re-hosted for loader compatibility. Human-annotated long-horizon robotics video question-answering data across three embodiments, used to train the allenai/Molmo2-ER-4B model. The upstream dataset is described in the paper 'RoboVQA: Multimodal Long-Horizon Reasoning for Robotics' (arXiv:2311.00899).
License is listed as cc-by-4.0 (inherited from upstream).