DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

A-OKVQA: Augmented Outside-Knowledge Visual Question Answering | DataSalon

Home Multimodal & LLMA-OKVQA: Augmented Outside-Knowledge Visual Question Answering

Multimodal & LLM

A-OKVQA: Augmented Outside-Knowledge Visual Question Answering

Name: A-OKVQA: Augmented Outside-Knowledge Visual Question Answering
Creator: allenai
Published: 2022-05-10T19:32:42
License: Apache-2.0
Keywords: Computer Vision, Natural Language Processing, Visual Question Answering

by allenai / allenai·Updated 2y ago

Available on 1 platform

Description

24,903 visual question-answering pairs paired with images from the COCO dataset, categorized into multiple-choice and direct-answer formats. Each entry includes human-annotated rationales explaining the reasoning required to answer questions that necessitate external knowledge beyond the visual content.

Use Cases

Develop explainable AI models by training on the 'rationales' field to justify visual reasoning steps
Benchmark visual question answering performance using the 'multiple_choice' and 'direct_answer' ground truth labels
Train multi-modal transformers to integrate external knowledge by processing the 'question' and 'image' inputs alongside knowledge retrieval systems

Strengths

24,903 unique questions split into training, validation, and test sets
Includes 'rationales' column providing natural language explanations for the correct answers
Features two distinct evaluation formats: 'multiple_choice' with four options and 'direct_answer' for open-ended response
Questions are mapped to 'image_id' from the COCO 2017 dataset

Computer Vision Natural Language Processing Visual Question Answering

Related Datasets

Quality Score

D22

Description

Source

Reputation

Quality Score

D22

Description

Source

Reputation

Access

Community

112 likes

0 views

Dataset Info

License: Apache-2.0
Author: allenai
Org: allenai
Created: May 10, 2022
Updated: May 8, 2024
Language: Python
Last synced: May 19, 2026

Access

Community

112 likes

0 views

Dataset Info

License: Apache-2.0
Author: allenai
Org: allenai
Created: May 10, 2022
Updated: May 8, 2024
Language: Python
Last synced: May 19, 2026

A-OKVQA: Augmented Outside-Knowledge Visual Question Answering

Description

Use Cases

Strengths

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info