ViQuAE: Visual Question Answering over Entities

Name: ViQuAE: Visual Question Answering over Entities
Creator: PaulLerner
Published: 2022-03-02T23:29:22
Keywords: Regionus

by PaulLernerUpdated 4y ago

Description

3,700 question-answer pairs linked to images and a knowledge base of 1.5 million Wikipedia entities. The dataset facilitates visual entity retrieval where answers are specific entities rather than generic object labels.

Use Cases

Train entity-linking models that map visual regions to specific Wikipedia entries
Benchmark retrieval-augmented generation (RAG) systems using the provided knowledge base and image queries
Develop multimodal reasoning models that combine visual features with structured external knowledge

Strengths

3,700 human-annotated questions requiring external knowledge
Knowledge base containing 1.5 million entities with associated text and images
Answers are mapped to unique Wikipedia entity identifiers

Regionus

Related Datasets

Quality Score

D25

Description

22

Source

36

Reputation

9

Access

22

Community

23 downloads

0 views

Dataset Info

Author: PaulLerner
Created: Mar 2, 2022
Updated: Feb 15, 2022
Last synced: Apr 29, 2026

Access

22

Community

23 downloads

0 views

Dataset Info

Author: PaulLerner
Created: Mar 2, 2022
Updated: Feb 15, 2022
Last synced: Apr 29, 2026

ViQuAE: Visual Question Answering over Entities

Description

Use Cases

Strengths

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info