Name: DeepSeek-ProverBench: Formal Theorem Proving Data for Lean 4
Creator: deepseek-ai
Published: 2025-04-30T14:51:59
Keywords: Librarypolars, Size Categoriesn1 K, Modalitytext, Mathematics, Librarymlcroissant, Librarydatasets, Librarypandas, Lean 4, Text, Formal Methods, Regionus, Llm Training, Theorem Proving, JSON

Description

DeepSeek-ProverBench is a dataset for training and evaluating large language models on formal theorem proving in the Lean 4 environment. It was created by deepseek-ai using a recursive theorem proving pipeline powered by the DeepSeek-V3 model to decompose complex problems into subgoals. The dataset was last updated on April 30,我们发现了一个问题，输入中的描述是中文的，但输出要求是英文。根据指令，我需要将输入翻译成英文。让我重新处理。2025.

Use Cases

Fine-tuning language models for formal theorem proving based on the described Lean 4 proof data.
Benchmarking the reasoning capabilities of AI models based on the described theorem decomposition pipeline.
Studying automated proof generation strategies based on the described subgoal resolution process.

Strengths

Data was generated using a sophisticated pipeline powered by the DeepSeek-V3 model.
Specifically designed for the modern Lean 4 theorem proving environment.
Last updated on 2025-04-30, indicating recent maintenance.

Limitations

Description metadata is limited; actual data quality requires manual inspection after download.
Column-level documentation is absent; field semantics must be inferred after download.
Row count, file formats, and license are unknown, which may limit suitability assessment.

Provenance

Source: deepseek-ai via Hugging Face.
Collection Method: Data collected through a recursive theorem proving pipeline powered by DeepSeek-V3.
Time Range: null
Freshness: Last updated 2025-04-30 14:54:40.
Geography: null

null

Text JSON Librarypolars Size Categoriesn1 K Modalitytext Mathematics Librarymlcroissant Librarydatasets Librarypandas Lean 4 Formal Methods Regionus Llm Training Theorem Proving

DeepSeek-ProverBench: Formal Theorem Proving Data for Lean 4

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info