Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
OlympicArena is a benchmark dataset from GAIR, last updated in July 2024, designed to evaluate AI capabilities across seven Olympic-level disciplines: Mathematics, Physics, Chemistry, Biology, Geography, Astronomy, and Computer Science. It features a detailed, fine-grained evaluation mechanism for assessing advanced cognitive reasoning.
Primary purpose is benchmarking AI; it may not be structured as a typical tabular dataset for direct analysis. Users should review the full description on the Hugging Face page for detailed evaluation protocols and access instructions.