Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
A temporally grounded benchmark for evaluating LLM reasoning during an ongoing geopolitical conflict. The dataset covers the early stages of the 2026 Middle East conflict, which unfolded after the training cutoff of current frontier models. It is authored by AIcell for the paper 'When AI Navigates the Fog of War'.
The full description is on an external dataset page; key details like columns, rows, sample data, and file formats are unknown from the provided snippet. License information is also unknown.