Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Training data generated from past ArXiv articles includes outputs from the Qwen3.6-35B model. The dataset contains the model's answers on whether perturbed mathematical statements are correct, with the expected answer always being disprove. It was created by MathArena and last updated on June 16, 2026.
License is unknown; users must verify terms before use.