Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
A training and evaluation corpus for VDocRAG, a retrieval-augmented generation framework designed to understand real-world documents from visual features. The dataset is a unified collection of open-domain document visual question answering data, encompassing diverse document types and formats. It was created by NTT-hil-insight and last updated on 2025-05-26.
License is unknown; users must verify terms of use before applying the dataset.