Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
MCSBench v1.0 is a diagnostic benchmark for evaluating multimodal large language models. It contains base visual question answering records, reasoning-chain selection records, evidence fields, and image references. The dataset was created by mcsbench and last updated on May 7, 2026.
License is unknown; terms of use must be verified.