Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
The VQA dataset contains open-ended questions about images, requiring an understanding of vision, language, and commonsense knowledge to answer. It was created by HuggingFaceM4 and last updated in June 2022.
The specific version (VQAv2) is implied by the title, but details on splits (train/val/test), annotation format, and license are not provided in the input.