Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
ViTextVQA contains over 16,000 images and 50,000 question-answer pairs focused on Vietnamese text comprehension within visual contexts. Developed by researcher minhquan6203 and documented in Arxiv paper 2404.10652, it serves as a benchmark for text-based visual question answering in the Vietnamese language.
The dataset is released under a Creative Commons Attribution-NonCommercial 3.0 license. Users should refer to the associated Arxiv paper 2404.10652 for detailed methodology and evaluation metrics.