Three categories of multimodal geo-spatial data—tabular grids, heatmaps, and geographic visualizations—designed for foundation model evaluation. The benchmark tests the ability to process dense numerical values and interpret spatial-temporal dependencies within these grid structures.
Use Cases
- Benchmark foundation models on their ability to extract insights from dense numerical tabular grids.
- Compare model accuracy when interpreting the same geographic data presented as heatmaps versus raw numbers.
- Assess spatial-temporal reasoning capabilities using the provided geographic visualizations and time-sequenced grids.
Strengths
- Includes multimodal representations of geo-spatial data in tabular, heatmap, and geographic visualization formats.
- Focuses on dense numerical values and grid structures to test foundation model precision.
- Evaluates spatial and temporal dependencies as core metrics for geo-spatial reasoning.