Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
137,000 images containing Vietnamese text paired with 822,679 synthetic visual question-answering pairs generated by Gemini 1.5 Flash. Created by 5CD-AI and updated in February 2026, this collection focuses on Vietnamese OCR and scene understanding.
Dataset is provided in Parquet format and is linked to Arxiv paper 2408.12480.