Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
VisRes Bench contains 10,000 to 100,000 image-text pairs designed to evaluate the visual reasoning of Vision-Language Models (VLMs) in naturalistic settings. Developed by researchers at TII (tiiuae) and updated in March 2026, it isolates visual logic by removing contextual language supervision.
Data is provided in Parquet format and is compatible with Polars, Dask, and the Hugging Face datasets library; refer to Arxiv paper 2512.21194 for methodology.