Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Visual Haystacks (VHs) is a benchmark dataset designed to evaluate Large Multimodal Models' capability to handle long-context visual information. It is described as the first vision-centric Needle-In-A-Haystack benchmark. The dataset was created by tsunghanwu and was last updated on Hugging Face on October 16, 2024.
The description notes that users should also download the COCO-2017 training and validation sets, indicating a dependency on external data.