Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
A collection of 45,882 prompt samples designed for Reinforcement Learning from Human Feedback training. Created by NVIDIA, this dataset supports language model alignment and was last updated in December 2025.
The full description, including specific columns and license details, is hosted externally on Hugging Face.