Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
GSM8K_zh contains 8,792 Chinese question-answer pairs for mathematical reasoning, translated from the English GSM8K dataset. It is split into 7,473 training samples and 1,319 testing samples for supervised fine-tuning and evaluation. The dataset was created by author ximzdawn using GPT-3.5-Turbo.
Testing samples may only contain questions (question_zh) without answers, as suggested by the description. Users should verify the exact structure and license on the dataset page.