DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

IFDecorator: Synthetic Datasets for Instruction Following RL with Verifiable Rewards | DataSalon

Home Multimodal & LLMIFDecorator: Synthetic Datasets for Instruction Following RL with Verifiable Rewards

Multimodal & LLM

IFDecorator: Synthetic Datasets for Instruction Following RL with Verifiable Rewards

Name: IFDecorator: Synthetic Datasets for Instruction Following RL with Verifiable Rewards
Creator: guox18
Published: 2025-08-06T09:52:24
Keywords: Text, Ai Training, Reinforcement Learning, Synthetic Data, Verifiable Rewards, Instruction Following, Synthetic

by guox18·Updated 11mo ago

Available on 1 platform

Description

3,625 training and 200 validation examples engineered for Reinforcement Learning with Verifiable Rewards (RLVR). The dataset, created by guox18, contains two complementary synthetic datasets with different synthesis approaches and difficulty distributions. It was last updated on August 8, 2025.

Use Cases

Training instruction-following agents based on the described high-quality synthetic data.
Benchmarking RLVR algorithms based on the controlled difficulty distributions mentioned.
Studying the impact of different synthetic data generation approaches on RL performance.

Strengths

Contains 3,825 total examples (3,625 training + 200 validation).
Features controlled difficulty distributions as described.
Comprises two complementary datasets with different synthesis approaches.

Limitations

Column-level documentation is absent; field semantics must be inferred after download.
Row count for the full dataset is unknown, which may limit suitability assessment.
Description metadata is limited; actual data quality requires manual inspection after download.

Provenance

Source: huggingface user guox18
Collection Method: Synthetic data engineering for RLVR, as described.
Freshness: Last updated 2025-08-08 08:46:53.

License is unknown; terms of use must be verified.

Text Ai Training Reinforcement Learning Synthetic Data Verifiable Rewards Instruction Following Synthetic

Related Datasets

Quality Score

C40

Description

Source

Reputation

Quality Score

C40

Description

Source

Reputation

Access

Community

217 downloads

2 likes

0 views

Dataset Info

Author: guox18
Created: Aug 6, 2025
Updated: Aug 8, 2025
Last synced: May 27, 2026

Access

Community

217 downloads

2 likes

0 views

Dataset Info

Author: guox18
Created: Aug 6, 2025
Updated: Aug 8, 2025
Last synced: May 27, 2026

IFDecorator: Synthetic Datasets for Instruction Following RL with Verifiable Rewards

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info