Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Training data for the 4DThinker framework, which enables Vision Language Models to 'think with 4D' through dynamic latent mental imagery. The dataset includes approximately 38,000 samples for DIFT training and 37,000 samples for 4DRL training, built upon SpatialVID and DSR_Suite-Data. It was authored by jankin123 and last updated on May 11, 2026.
License is unknown, which may restrict usage.