DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

Nemotron-RL-Super: Training Blends for Nemotron-3-Super-120B-A12B | DataSalon

Home Multimodal & LLMNemotron-RL-Super: Training Blends for Nemotron-3-Super-120B-A12B

Multimodal & LLM

Nemotron-RL-Super: Training Blends for Nemotron-3-Super-120B-A12B

Name: Nemotron-RL-Super: Training Blends for Nemotron-3-Super-120B-A12B
Creator: nvidia
Published: 2026-03-11T03:32:01
Keywords: Licensecc By 40, Regionus

by nvidia·Updated 4mo ago

Available on 1 platform

Description

NVIDIA released this collection of dataset blends in March 2026 to document the specific data mixtures used for Reinforcement Learning (RL) training of the Nemotron-3-Super-120B-A12B model. The data is organized into six distinct training stages including Reinforcement Learning from Verifiable Rewards (RLVR), Software Engineering (SWE), and Reinforcement Learning from Human Feedback (RLHF).

Use Cases

Replicating RL training stages for large language models using the specified mixing ratios
Analyzing the composition of SWE 1 and SWE 2 stages for software engineering tasks
Evaluating the impact of RLVR (Verifiable Rewards) data on model performance

Strengths

Provides exact mixing ratios for six distinct RL training stages
Covers specialized domains including Software Engineering (SWE) and Verifiable Rewards (RLVR)
Released by NVIDIA to provide transparency for Nemotron-3-Super-120B-A12B training

Limitations

Missing raw row counts and specific column schemas in the provided metadata
Excludes 'additional data' mentioned in the source description
High dependency on external component datasets for full replication

Provenance

Source: NVIDIA
Collection Method: Curated blends of existing datasets
Freshness: Last updated March 2026.
Geography: United States

Users should refer to the full description on the Hugging Face page for specific mixing percentages; the dataset is licensed under CC BY 4.0.

Licensecc By 40 Regionus

Related Datasets

Quality Score

D38

Description

Source

Reputation

Quality Score

D38

Description

Source

Reputation

Access

Community

983 downloads

24 likes

0 views

Dataset Info

Author: nvidia
Created: Mar 11, 2026
Updated: Mar 12, 2026
Last synced: Jun 11, 2026

Access

Community

983 downloads

24 likes

0 views

Dataset Info

Author: nvidia
Created: Mar 11, 2026
Updated: Mar 12, 2026
Last synced: Jun 11, 2026

Nemotron-RL-Super: Training Blends for Nemotron-3-Super-120B-A12B

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info