ChartGalaxy is a dataset for infographic chart understanding and generation. The dataset was updated in February 2026 with a new batch of 108,208 charts, following a December 2025 update of 19,458 charts, featuring broader diversity in title designs and more polished layouts. The dataset was last updated on 2026-04-19.
Use Cases
- Train models for chart type classification based on the described infographic chart collection.
- Develop visual question answering systems for charts using the dataset's infographic content.
- Generate synthetic or enhanced infographic charts using the dataset's diverse title designs and layouts.
- Benchmark layout analysis and readability assessment algorithms on the polished chart layouts.
Strengths
- Contains at least 127,666 infographic charts, based on the combined totals from the 2025 and 2026 updates.
- The February 2026 update specifically features broader diversity in title designs and more polished layouts, improving readability.
- The dataset is actively maintained, with two documented updates in December 2025 and February 2026.
Limitations
- Column-level documentation is absent; field semantics must be inferred after download.
- Row count is unknown beyond the update totals, which may limit suitability assessment for specific tasks.
- Description metadata is limited; actual data quality requires manual inspection after download.
Provenance
- Source
- ChartGalaxy
- Collection Method
- Likely collected or synthesized for research in chart understanding and generation.
- Time Range
- Updates occurred in December 2025 and February 2026.
- Freshness
- Last updated 2026-04-19 04:58:04; freshness should be verified.
- Geography
- null