7.5 million mathematical reasoning problems generated via a Python script. The dataset covers a wide range of topics including arithmetic, algebra, fractions, exponents, and word problems. It was created by DataMuncher-Labs and last updated on December 29, 2025.
Use Cases
- Training language models for arithmetic problem-solving based on the 'add', 'sub', 'mul', and 'div' categories
- Benchmarking AI performance on algebraic equations based on 'linear_eq', 'two_step_eq', and 'quadratic' categories
- Developing educational tools for word problem comprehension based on the 'word_algebra' category
- Testing symbolic reasoning capabilities on advanced topics based on 'system_3x3', 'diophantine', and 'log_eq' categories
Strengths
- Contains 7.5 million problems, providing a large-scale resource for training
- Covers a diverse set of mathematical topics as listed in the description, including arithmetic, algebra, and advanced equations
Limitations
- Column-level documentation is absent; field semantics must be inferred after download
- Row count is unknown, which may limit suitability assessment
- Description metadata is limited; actual data quality requires manual inspection after download
Provenance
- Source
- DataMuncher-Labs
- Collection Method
- Built via a Python script
- Freshness
- Last updated 2025-12-29 22:53:58; freshness should be verified