Viet Sharegpt 4O Text Vqa

Name: Viet Sharegpt 4O Text Vqa
Creator: 5CD-AI
Published: 2024-09-16T07:20:54
Keywords: Size Categories10 Kn100 K, Librarypolars, Librarydask, Modalitytext, Librarymlcroissant, Modalityimage, Librarydatasets, Parquet, Regionus, Arxiv240812480

by 5CD-AIUpdated 1y ago

Available on 1 platform

Sign in to view source links and access this dataset

Description

42,678 Vietnamese images paired with detailed text descriptions and visual question-answering pairs generated by GPT-4o. The dataset includes spatial metadata for objects and text, covering specific attributes such as font style, color, and size within a Vietnamese linguistic context.

Use Cases

Train Vietnamese OCR models that require font and color recognition using the text description fields
Develop multimodal LLMs capable of spatial reasoning by leveraging the object location and quantity data
Fine-tune visual question answering (VQA) systems for Vietnamese using the detailed long-form answer pairs
Build image captioning models that describe complex scenes including object composition and text attributes

Strengths

42,678 Vietnamese images with corresponding GPT-4o generated annotations
Includes text-specific metadata such as font style, color, position, and size for all recognized text
Features object-level details including location coordinates and quantity counts within the image descriptions
Provides long-form, detailed answers for visual question-answering tasks in the Vietnamese language

Parquet Size Categories10 Kn100 K Librarypolars Librarydask Modalitytext Librarymlcroissant Modalityimage Librarydatasets Regionus Arxiv240812480

Related Datasets

Quality Score

D36

Description

39

Source

36

Reputation

38

Access

22

Community

84 downloads

57 likes

0 views

Dataset Info

Author: 5CD-AI
Created: Sep 16, 2024
Updated: Oct 1, 2024
Last synced: Jul 23, 2026

Access

22

Community

84 downloads

57 likes

0 views

Dataset Info

Author: 5CD-AI
Created: Sep 16, 2024
Updated: Oct 1, 2024
Last synced: Jul 23, 2026

Viet Sharegpt 4O Text Vqa

Description

Use Cases

Strengths

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info