DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

Math SFT Solutions No CoT V2: A Cleaned Instruction Tuning Dataset | DataSalon

Home NLP & TextMath SFT Solutions No CoT V2: A Cleaned Instruction Tuning Dataset

NLP & Text

Math SFT Solutions No CoT V2: A Cleaned Instruction Tuning Dataset

Name: Math SFT Solutions No CoT V2: A Cleaned Instruction Tuning Dataset
Creator: kaushik-harsh-99
Published: 2026-06-07T04:16:39
Keywords: Mathematics, Text, Language Model, Instruction Tuning, Supervised Fine Tuning, Synthetic

by kaushik-harsh-99·Updated 26d ago

Available on 1 platform

Description

A cleaned mathematical supervised fine-tuning dataset designed for instruction tuning and mathematical capability adaptation. The dataset introduces a simplified instruction–response format and removes intermediate reasoning contamination. It was created by author kaushik-harsh-99 and was last updated on 2026-06-07.

Use Cases

Instruction tuning of language models based on the simplified instruction–response format.
Adapting models for mathematical capability based on the dataset's mathematical content.
Training models on GSM8K-style problems using the augmented mathematical responses mentioned.
Evaluating model performance on tasks without chain-of-thought reasoning contamination.

Strengths

Dataset is specifically cleaned for supervised fine-tuning, removing intermediate reasoning contamination.
Version 2 introduces a simplified instruction–response format.
Includes augmented mathematical responses generated over GSM8K-style problems.

Limitations

Column-level documentation is absent; field semantics must be inferred after download.
Row count is unknown, which may limit suitability assessment.
Description metadata is limited; actual data quality requires manual inspection after download.

Provenance

Source: huggingface
Freshness: Last updated 2026-06-07 04:28:17; freshness should be verified.

Text Mathematics Language Model Instruction Tuning Supervised Fine Tuning Synthetic

Related Datasets

Quality Score

D38

Description

Source

Reputation

Quality Score

D38

Description

Source

Reputation

Access

Community

6 downloads

3 likes

0 views

Dataset Info

Author: kaushik-harsh-99
Created: Jun 7, 2026
Updated: Jun 7, 2026
Last synced: Jun 14, 2026

Access

Community

6 downloads

3 likes

0 views

Dataset Info

Author: kaushik-harsh-99
Created: Jun 7, 2026
Updated: Jun 7, 2026
Last synced: Jun 14, 2026

Math SFT Solutions No CoT V2: A Cleaned Instruction Tuning Dataset

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info