Sign in to view source links and access this dataset
Description
A dataset from the reasoning-core organization, last updated on September 18, 2025, hosted on Hugging Face. It contains problems for evaluating mathematical reasoning and automated theorem proving, likely sourced from the TPTP (Thousands of Problems for Theorem Provers) library. The dataset includes a test split intended for benchmarking model performance.
Use Cases
Benchmarking large language models on formal mathematical reasoning tasks based on the described evaluation context.
Training models for automated theorem proving based on problems from the TPTP library.
Evaluating the logical deduction capabilities of AI systems based on the dataset's focus on mathematical reasoning.
Strengths
Dataset is hosted on Hugging Face, facilitating easy access and integration with common ML tools.
Last updated timestamp is explicitly provided: 2025-09-18 09:26:08.
Limitations
Column-level documentation is absent; field semantics must be inferred after download.
Row count is unknown, which may limit suitability assessment.
Description metadata is limited; actual data quality requires manual inspection after download.
Provenance
Source
reasoning-core organization
Freshness
Last updated 2025-09-18 09:26:08.
License is unknown; terms of use must be verified before application.