Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Sft Hh Rlhf is a dataset published on Hugging Face by Dahoas, with its last update recorded on 2022-12-22. The title suggests it contains data related to reinforcement learning from human feedback (RLHF) and supervised fine-tuning (SFT). The dataset's specific content, scale, and structure require verification after download.
License is unknown; users must verify permissions before use.