Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
VLM-SubtleBench provides between 10,000 and 100,000 image pairs to evaluate the subtle comparative reasoning capabilities of Vision-Language Models. Developed by KRAFTON and released in early 2026, the dataset targets domains where visual differences are nuanced, such as medical imaging and industrial anomaly detection.
Released under CC BY-NC 4.0, which prohibits commercial use; requires tools capable of handling multi-image input for VLMs.