Sign in to view source links and access this dataset
Description
56,989 images depicting quintessentially Vietnamese scenes were annotated using Visual Question Answering (VQA) techniques. The dataset includes landscapes, historical sites, culinary specialties, festivals, and everyday life from various regions. It was created by 5CD-AI and last updated on Hugging Face in August 2024.
Use Cases
Train Visual Question Answering models based on Vietnamese cultural imagery.
Benchmark multimodal AI performance on region-specific visual concepts.
Fine-tune image captioning models for Vietnamese scenes and objects.
Develop educational or cultural heritage applications using annotated visual data.
Strengths
Contains 56,989 images, providing a substantial base for model training.
Focuses on a specific cultural and geographic context (Vietnam).
Annotations were created using advanced VQA techniques, suggesting structured question-answer pairs.
Limitations
Column-level documentation is absent; field semantics must be inferred after download.
Row count is known, but specific sample data, file formats, and license are unknown.
Data may reflect geographic and cultural bias inherent to its specific Vietnamese focus.
Provenance
Source
5CD-AI via Hugging Face.
Collection Method
Images were analyzed and annotated using advanced Visual Question Answering (VQA) techniques.
Time Range
null
Freshness
Last updated 2024-08-25 16:28:57.
Geography
Vietnam, covering various regions.
License is unknown; terms of use must be verified before application.