Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
AutoMathText is a dataset of approximately 200 GB of mathematical texts compiled from sources including various websites, arXiv, and GitHub repositories like OpenWebMath, RedPajama, and Algebraic Stack. The dataset was created by author 'math-ai' and its associated work was accepted to ACL 2025 Findings. The dataset was last updated on July 16, 2025.
License is unknown; users must verify terms before use.