580 Lean-formalized STEM problem-solving examples across physics, chemistry, calculus, and probability domains. The dataset was created by anon-ed-2026 and is hosted on Hugging Face. It was last updated on May 4, 2026.
Use Cases
- Training theorem-proving AI models based on Lean-formalized STEM problems.
- Benchmarking automated reasoning systems based on examples across physics, chemistry, calculus, and probability.
- Developing educational tools for interactive STEM learning based on formalized problem-solving examples.
- Studying the structure and complexity of formal proofs in various scientific domains.
Strengths
- Contains 580 examples, providing a substantial collection for training or benchmarking.
- Covers multiple STEM domains including physics, chemistry, calculus, and probability.
- Examples are formalized in Lean, a language for interactive theorem proving.
Limitations
- Column-level documentation is absent; field semantics must be inferred after download.
- Row count is known, but the specific structure and content of each example are not detailed.
- Description metadata is limited; actual data quality requires manual inspection after download.
Provenance
- Source
- Hugging Face dataset repository by anon-ed-2026.
- Freshness
- Last updated 2026-05-04 06:09:48; freshness should be verified.