The MiniF2F dataset contains mathematical problems from sources like AMC competitions paired with their formal statements in the Lean theorem prover format. It was created by Tonic and last updated on 2025-02-05. Each example includes both informal mathematical statements and their corresponding formal representations.
Use Cases
- Benchmarking theorem-proving AI models based on the collection of formalized competition problems.
- Training models for informal-to-formal statement translation based on the paired problem statements.
- Studying the formalization process in Lean based on the provided examples.
- Developing educational tools for formal mathematics based on the competition problem sources.
Strengths
- Pairs informal mathematical statements with their formal Lean counterparts, providing a direct mapping.
- Sources problems from established competitions like AMC, likely ensuring a standard of quality.
- Last updated on 2025-02-05, indicating recent maintenance.
Limitations
- Column-level documentation is absent; field semantics must be inferred after download.
- Row count is unknown, which may limit suitability assessment.
- Description metadata is limited; actual data quality requires manual inspection after download.
Provenance
- Source
- Various mathematical sources, including AMC competitions.
- Collection Method
- Collection and formalization of problems into Lean.
- Freshness
- Last updated 2025-02-05 10:29:25; freshness should be verified.