Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
5,227 mathematical Olympiad problems and rubrics for reinforcement learning training, released by lm-provers in 2026. The collection includes problems from international competitions and Art of Problem Solving (AoPS) annotated with Gemini-3-Pro rubrics and Qwen-based rewards.
Associated with Arxiv paper 2511.01846; rubrics and scores are model-generated.