Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
IMO-AnswerBench is a benchmark dataset for evaluating mathematical reasoning in large language models. It consists of 400 challenging short-answer problems from the International Mathematical Olympiad and other sources. The dataset was released by Google DeepMind in conjunction with their 2025 IMO gold medal achievement and is hosted by OpenEvals.
License is unknown; terms of use must be verified before application.