Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
20,000 samples combine questions and images from three established VQA datasets: AOKVQA, Path-VQA, and TDIUC. This medium-sized benchmark is designed to test the multi-domain knowledge of vision-language models. It was created by dutta18 for educational and research purposes, with copyright retained by the original dataset owners.
Intended for educational and research purposes only; all copyright belongs to the original dataset owners.