Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
WM Bench v1.0 is the first benchmark designed to evaluate the cognitive capabilities of World Models and Embodied AI systems. It was created by FINAL-Bench and released in March 2026. The benchmark moves beyond measuring visual fidelity to assess a model's reasoning and understanding.
The complete dataset description and detailed task definitions are hosted externally on Hugging Face. License information is not provided in the available input.