Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
96 challenging questions based on images from OpenImages form this evaluation benchmark for hallucination in Large Multimodal Models. It includes ground-truth answers and image contents. The dataset was created by Shengcao1006 and uploaded in November 2023.
License is listed as Apache 2.0 on the platform, but confirmation from the original description is advised. The dataset is designed solely for evaluation, not for training large-scale models.