Name: Nemotron Cascade RL: 108,938 Prompts for Instruction-Following Reinforcement Learning
Creator: nvidia
Published: 2025-12-15T23:16:44
Keywords: Librarypolars, Training Data, Languageen, Prompt Engineering, Modalitytext, Size Categories100 Kn1 M, Librarymlcroissant, Librarydatasets, Librarypandas, Text, Parquet, Language Model, Regionus, Reinforcement Learning, Licenseodc By, Instruction Following

Description

NVIDIA's Nemotron-Cascade-RL-IF-RL dataset contains 108,938 samples designed for Instruction-Following Reinforcement Learning (IF-RL). The dataset includes prompts and associated metadata to improve language models' instruction-following capability and is ready for commercial use with attribution. It was last updated on December 16, III.

Use Cases

Training reinforcement learning agents for instruction-following based on the provided prompts and metadata.
Fine-tuning large language models to improve their response alignment with human instructions.
Benchmarking the performance of language models on instruction-following tasks.
Developing reward models for reinforcement learning from human feedback (RLHF) workflows.

Strengths

Contains 108,938 training samples, providing a substantial base for model training.
Explicitly stated as ready for commercial use, clarifying licensing for practitioners.
Created by NVIDIA, a leading institution in AI hardware and software development.

Limitations

Column-level documentation is absent; field semantics must be inferred after download.
Row count for the full dataset is unknown, which may limit suitability assessment.
Data may reflect source bias inherent to the specific prompts and metadata collected.

Provenance

Source: nvidia
Collection Method: Likely generated or curated for training reinforcement learning agents on instruction-following tasks.
Time Range: null
Freshness: Last updated 2025-12-16 06:15:47; freshness should be verified.
Geography: null

null

Text Parquet Librarypolars Training Data Languageen Prompt Engineering Modalitytext Size Categories100 Kn1 M Librarymlcroissant Librarydatasets Librarypandas Language Model Regionus Reinforcement Learning Licenseodc By Instruction Following

Nemotron Cascade RL: 108,938 Prompts for Instruction-Following Reinforcement Learning

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info