Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Microsoft's VITRA dataset contains real-world robot teleoperation demonstrations collected using a 7-DoF robotic arm with a dexterous hand and a head-mounted RGB camera. Each episode provides synchronized numerical state and action data alongside video recordings, intended for finetuning vision-language-action models for robotic manipulation.
The full description is available on the Hugging Face dataset page; specific data schema details like columns and sample data are not provided in this summary.