Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Nemotron-RL-Math-v2 is a small curated set of mathematical problems selected for reinforcement learning workflows. The dataset is designed for Reinforcement Learning from Verifiable Rewards (RLVR) and was created by NVIDIA. Problems are sourced from AoPS and StackExchange-derived math data held out from the Nemotron-SFT-Math-v4 SFT set.
License is unknown; terms of use must be verified before application.