University-Level Math Reasoning Problems With Grading Rubrics

Name: University-Level Math Reasoning Problems With Grading Rubrics
Creator: BakeAI
Published: 2026-03-11T22:55:00
Keywords: Task Categoriestext Generation, Librarypolars, Task Categoriesquestion Answering, Languageen, Size Categoriesn1 K, Modalitytext, Librarymlcroissant, Librarydatasets, Librarypandas, Education, Regionus, Reasoning, Reinforcement Learning, JSON, Math, Licenseapache 20, Reward Model

by BakeAIUpdated 3mo ago

Available on 1 platform

Sign in to view source links and access this dataset

Description

BakeAI's preview dataset contains 50 challenging university-level mathematics reasoning problems. Each problem includes a detailed reference solution, a structured grading rubric, and an anonymized model evaluation result.

Use Cases

Benchmark model performance on multi-step reasoning problems using the structured grading rubric for evaluation.
Analyze frontier model attempts against point-by-point grading criteria to identify failure modes in complex computation.
Train or fine-tune models for proof construction tasks using the provided reference solutions as supervision.

Strengths

Contains 50 challenging, university-level problems designed for multi-step reasoning.
Each entry includes a detailed reference solution and a structured point-by-point grading rubric.
Provides anonymized model evaluation results against the rubric for benchmarking.

Limitations

Small sample size of 50 problems limits statistical power for broad model evaluation.
Preview nature suggests the dataset may be incomplete or a subset of a larger collection.
Geographic or topical bias is possible as the region is listed as US and focus is university-level math.

Provenance

Source: BakeAI
Collection Method: null
Time Range: null
Freshness: null
Geography: United States (based on 'Regionus' tag)

null

JSON Task Categoriestext Generation Librarypolars Task Categoriesquestion Answering Languageen Size Categoriesn1 K Modalitytext Librarymlcroissant Librarydatasets Librarypandas Education Regionus Reasoning Reinforcement Learning Math Licenseapache 20 Reward Model

Related Datasets

Quality Score

D36

Description

39

Source

36

Reputation

38

Access

22

Community

6 downloads

1 likes

0 views

Dataset Info

Author: BakeAI
Created: Mar 11, 2026
Updated: Mar 11, 2026

Access

22

Community

6 downloads

1 likes

0 views

Dataset Info

Author: BakeAI
Created: Mar 11, 2026
Updated: Mar 11, 2026

University-Level Math Reasoning Problems With Grading Rubrics

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info