Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Training data generated from past ArXiv articles using the BrokenArXiv pipeline. The dataset is intended for training models on research-level mathematical problems and is licensed under cc-by-4.0, though individual rows may have different licenses. The dataset was created by MathArena and was last updated on 2026-06-16.
Each individual row may have a different license depending on the source article, which must be respected by downstream users.