SCI-CQA: 5,629 Chart Understanding Questions from Scientific Literature

Name: SCI-CQA: 5,629 Chart Understanding Questions from Scientific Literature
Creator: lyndons1
Published: 2024-09-05T10:48:08
Keywords: Chart Understanding, Benchmark, Question Answering, Scientific Literature, Multimodal Benchmark, Multimodal

by lyndons1Updated 1y ago

Available on 1 platform

Sign in to view source links and access this dataset

Description

SCI-CQA is a multimodal benchmark dataset for evaluating chart understanding, inspired by human exams. It contains 5,629 curated objective and open-ended questions paired with 2,894 chart images from scientific literature. The dataset was created by lyndons1 and last updated on April 28, 2025.

Use Cases

Benchmarking chart question-answering models based on the 5,629 curated questions.
Training multimodal vision-language models based on the 2,894 chart images and their associated questions.
Evaluating model performance on scientific literature comprehension based on the dataset's source material.
Developing models for open-ended reasoning about charts based on the included open-ended question format.

Strengths

5,629 carefully curated questions provide a substantial evaluation set.
2,894 chart images offer a visual corpus for multimodal tasks.
Framework includes both objective and open-ended questions, enabling diverse evaluation.

Limitations

Column-level documentation is absent; field semantics must be inferred after download.
Row count for the question-answer data is unknown, which may limit suitability assessment.
Description metadata is limited; actual data quality requires manual inspection after download.

Provenance

Source: huggingface
Collection Method: Curated from scientific literature, as described in the associated paper.
Time Range: null
Freshness: Last updated 2025-04-28 06:40:25.
Geography: null

null

Multimodal Chart Understanding Benchmark Question Answering Scientific Literature Multimodal Benchmark

Related Datasets

Quality Score

C41

Description

51

Source

39

Reputation

40

Access

22

Community

6 downloads

3 likes

0 views

Dataset Info

Author: lyndons1
Created: Sep 5, 2024
Updated: Apr 28, 2025
Last synced: Apr 23, 2026

Access

22

Community

6 downloads

3 likes

0 views

Dataset Info

Author: lyndons1
Created: Sep 5, 2024
Updated: Apr 28, 2025
Last synced: Apr 23, 2026

SCI-CQA: 5,629 Chart Understanding Questions from Scientific Literature

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info