Open-Omega-Forge-1M: A Compact Reasoning Dataset for Mathematics, Science, and Coding

Name: Open-Omega-Forge-1M: A Compact Reasoning Dataset for Mathematics, Science, and Coding
Creator: prithivMLmods
Published: 2025-07-14T16:22:09
Keywords: Training Data, Mathematics, Benchmark, Text, Reasoning, Coding, Science

by prithivMLmodsUpdated 4mo ago

Available on 1 platform

Sign in to view source links and access this dataset

Description

Open-Omega-Forge-1M is a curated collection derived from multiple high-quality datasets, designed to enhance reasoning capabilities. The dataset is a focused subset intended to maintain quality and diversity while providing a more manageable size for training and evaluation. It was created by prithivMLmods and last updated on 2026-03-09.

Use Cases

Training language models for mathematical problem-solving based on the described reasoning patterns.
Benchmarking model performance on scientific reasoning tasks as indicated by the dataset's scope.
Fine-tuning code generation models using the coding-related content mentioned in the description.
Evaluating the generalization of reasoning models across domains as suggested by the dataset's curation.

Strengths

Derived from multiple high-quality source datasets.
Designed to be a more manageable size for training and evaluation.
Curated to maintain quality and diversity of reasoning patterns.
Last updated on 2026-03-09.

Limitations

Column-level documentation is absent; field semantics must be inferred after download.
Row count is unknown, which may limit suitability assessment.
Description metadata is limited; actual data quality requires manual inspection after download.

Provenance

Source: prithivMLmods
Collection Method: Curated and optimized collection derived from multiple high-quality datasets.
Freshness: Last updated 2026-03-09 06:03:46.

License is unknown.

Text Training Data Mathematics Benchmark Reasoning Coding Science

Related Datasets

Quality Score

C40

Description

42

Source

39

Reputation

49

Access

26

Community

587 downloads

7 likes

0 views

Dataset Info

Author: prithivMLmods
Created: Jul 14, 2025
Updated: Mar 9, 2026
Last synced: Jun 4, 2026

Access

26

Community

587 downloads

7 likes

0 views

Dataset Info

Author: prithivMLmods
Created: Jul 14, 2025
Updated: Mar 9, 2026
Last synced: Jun 4, 2026

Open-Omega-Forge-1M: A Compact Reasoning Dataset for Mathematics, Science, and Coding

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info