Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
COSMIC is a benchmark dataset created by mair-lab to test whether multimodal language models can transform local, viewpoint-dependent observations into shared spatial models through language. It places two static agents in the same indoor scene from different egocentric viewpoints, requiring them to communicate exclusively through natural language to jointly solve a spatial question-answering task. The dataset was last updated on 2026-03-31.
License restrictions are unknown and must be verified before use.