Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
HorizonMath is a benchmark for measuring AI progress in mathematical discovery through automated verification, as described in a 2026 arXiv paper by Erik Y. Wang and colleagues. The dataset was created by 'squashenthus' and last updated on Hugging Face in March 2026. It focuses on evaluating AI systems' ability to generate and verify mathematical statements.
The full description and data details are only available on the Hugging Face dataset page. License information is unknown and must be checked before use.