DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

Skywork-OR1-RL-Data: 100K-1M RL Problems with 0-16 Difficulty Levels | DataSalon

Home Multimodal & LLMSkywork-OR1-RL-Data: 100K-1M RL Problems with 0-16 Difficulty Levels

Multimodal & LLM

Skywork-OR1-RL-Data: 100K-1M RL Problems with 0-16 Difficulty Levels

Name: Skywork-OR1-RL-Data: 100K-1M RL Problems with 0-16 Difficulty Levels
Creator: Skywork
Published: 2025-04-12T10:01:22
Keywords: Librarypolars, Librarydask, Modalitytext, Size Categories100 Kn1 M, Librarymlcroissant, Arxiv250522312, Librarydatasets, Parquet, Regionus

by Skywork·Updated 1y ago

Available on 1 platform

Description

Skywork-OR1-RL-Data is a reinforcement learning training dataset containing between 100,000 and 1,000,000 text records released by Skywork in April 2025. The collection features problems categorized by difficulty levels ranging from 0 to 16, calibrated against specific DeepSeek-R1-Distill-Qwen model variants.

Use Cases

RL training of LLMs using the 0-16 difficulty level labels to curate curriculum learning
Benchmarking model distillation by comparing performance against the DeepSeek-R1-Distill-Qwen difficulty baselines
Analyzing model failure modes on specific difficulty tiers identified in the metadata

Strengths

100,000 to 1,000,000 records
Granular difficulty scoring on a 0-16 scale
Calibrated against specific DeepSeek-R1-Distill-Qwen model variants

Limitations

Difficulty scores are model-dependent and may not reflect absolute difficulty for non-Qwen architectures
Intentional exclusion of extreme difficulty levels (0 and 16) creates a truncated dataset distribution

Provenance

Source: Skywork
Collection Method: Curated problems filtered by model-based difficulty assessment relative to DeepSeek-R1-Distill-Qwen
Freshness: Last updated May 29, 2025.

Difficulty filtering is specific to DeepSeek-R1-Distill-Qwen-1.5B, 7B, and 32B; refer to Arxiv 250522312 for the full methodology.

Parquet Librarypolars Librarydask Modalitytext Size Categories100 Kn1 M Librarymlcroissant Arxiv250522312 Librarydatasets Regionus

Related Datasets

Quality Score

D40

Description

Source

Reputation

Quality Score

D40

Description

Source

Reputation

Access

Community

693 downloads

64 likes

0 views

Dataset Info

Author: Skywork
Created: Apr 12, 2025
Updated: May 29, 2025
Last synced: Jul 25, 2026

Access

Community

693 downloads

64 likes

0 views

Dataset Info

Author: Skywork
Created: Apr 12, 2025
Updated: May 29, 2025
Last synced: Jul 25, 2026

Skywork-OR1-RL-Data: 100K-1M RL Problems with 0-16 Difficulty Levels

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info