2,952 declarations extracted from Coq source files formalizing category theory without axioms. The dataset is sourced from the GitHub repository https://github.com/jwiegley/category-theory and was last updated on January 13, 2026. It was authored by phanerozoic.
Use Cases
- Training language models on formal proofs based on the extracted declarations.
- Fine-tuning theorem provers using the structured formalization.
- Retrieval-augmented generation for proof assistants based on the dataset content.
- Learning proof embeddings and representations from the formal category theory statements.
Strengths
- Contains 2,952 formal declarations, providing a substantial corpus of structured mathematical statements.
- The formalization is axiom-free, which may be a specific methodological strength for certain applications.
Limitations
- Column-level documentation is absent; field semantics must be inferred after download.
- Row count is unknown, which may limit suitability assessment.
- Last updated 2026-01-13 23:24:21; freshness should be verified.
Provenance
- Source
- Repository: https://github.com/jwiegley/category-theory
- Collection Method
- Extracted from Coq source files.
- Freshness
- Last updated 2026-01-13 23:24:21.