Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
RoboFine-Bench is a benchmark for evaluating Vision-Language Models on execution-level details of robot manipulation. It contains 500 held-out robot videos and is part of the FineVLA framework for fine-grained instruction alignment. The dataset was authored by xlangai and last updated on HuggingFace in May 2026.
License is unknown; terms of use must be verified before application.