Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Tokyo driving data provides a large-scale visual question answering dataset for physically grounded spatiotemporal reasoning. It contains 16 million question-answer pairs over 270,000 frames, constructed from 100 hours of multi-sensor driving data. The dataset was created by turing-motors and last updated on the platform in January 2026.
The dataset page notes this is a preliminary mini version and does not fully match the format of the full dataset.