Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
123 proof-improvement instances were mined from code review on mathlib4, the Lean 4 theorem prover library. The dataset was created by SJCaldwell to benchmark language models on judging proof quality, distinct from correctness. It was last updated in March 2026.
Full dataset details, including specific columns and license, are only available on the linked Hugging Face dataset page.