Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Anthropic Hh Rlhf Preprocessed is a dataset published on huggingface by TheHassanSaud. The title suggests it contains preprocessed data from Anthropic's 'HH' (Helpful and Harmless) project, likely used for Reinforcement Learning from Human Feedback (RLHF). The dataset was last updated on 2026-04-24 18:40:45.
License is unknown; users must verify terms before commercial or research use.