Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
This dataset inherits from the original Anthropic/hh-rlhf collection and has been formatted using the OpenAI chat convention for Direct Preference Optimization (DPO) fine-tuning. Each conversational response has been labeled for safety using the LLaMa Guard model. The dataset was uploaded by author javirandor and last updated on March 28, 2025.
License information is unknown; terms of use for the derived dataset should be verified.