DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

World Model Cognitive Benchmark for Embodied AI | DataSalon

Home Reinforcement LearningWorld Model Cognitive Benchmark for Embodied AI

Reinforcement Learning

World Model Cognitive Benchmark for Embodied AI

Name: World Model Cognitive Benchmark for Embodied AI
Creator: FINAL-Bench
Published: 2026-03-29T15:03:30
Keywords: World Model Benchmark, Wm Bench, Librarypolars, Languageen, Prometheus, World Model, Size Categoriesn1 K, Modalitytext, Librarymlcroissant, Librarydatasets, Benchmark, Librarypandas, Computer Vision, Artificial Intelligence, Task Categoriesother, Regionus, Cognitive Evaluation, Vidraft, Agi, JSON, Licenseapache 20, Multimodal

by FINAL-Bench·Updated 3mo ago

Available on 1 platform

Description

WM Bench v1.0 is the first benchmark designed to evaluate the cognitive capabilities of World Models and Embodied AI systems. It was created by FINAL-Bench and released in March 2026. The benchmark moves beyond measuring visual fidelity to assess a model's reasoning and understanding.

Use Cases

Benchmarking a world model's ability to predict future states based on its internal representations.
Evaluating an embodied AI system's performance on tasks requiring planning and reasoning, as defined by the benchmark's metrics.
Comparing different model architectures on their cognitive capabilities using the provided evaluation protocol.

Strengths

First benchmark specifically for world model and embodied AI cognitive evaluation.
Released in March 2026, indicating recent development.

Limitations

Specific dataset size, row count, and column details are unknown.
The full description and task specifics require visiting an external page.

Provenance

Source: FINAL-Bench
Freshness: Last updated 2026-03-29.

The complete dataset description and detailed task definitions are hosted externally on Hugging Face. License information is not provided in the available input.

Multimodal JSON World Model Benchmark Wm Bench Librarypolars Languageen Prometheus World Model Size Categoriesn1 K Modalitytext Librarymlcroissant Librarydatasets Benchmark Librarypandas Computer Vision Artificial Intelligence Task Categoriesother Regionus Cognitive Evaluation Vidraft Agi Licenseapache 20

Related Datasets

Quality Score

D38

Description

Source

Reputation

Quality Score

D38

Description

Source

Reputation

Access

Community

963 downloads

23 likes

0 views

Dataset Info

Author: FINAL-Bench
Created: Mar 29, 2026
Updated: Mar 29, 2026
Last synced: Jun 26, 2026

Access

Community

963 downloads

23 likes

0 views

Dataset Info

Author: FINAL-Bench
Created: Mar 29, 2026
Updated: Mar 29, 2026
Last synced: Jun 26, 2026

World Model Cognitive Benchmark for Embodied AI

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info