Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
SlideVQA is a document visual question answering dataset containing between 10,000 and 100,000 records, released by NTT-hil-insight in 2023. It focuses on multi-image reasoning where models must select specific evidence slides from a deck to answer natural language questions.
Users should consult the Arxiv paper 2301.04883 for specific evaluation metrics regarding evidence selection accuracy.