Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
The TextVQA dataset contains 45,336 questions based on 28,408 images from the OpenImages collection. It requires models to read and reason about text present within images to answer the provided questions.
License is indicated as CC BY 4.0; users should verify the specific terms. The dataset is monolingual (English).