Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Psychology RLHF data was used to train a LLaMA-7B reward model. The dataset was uploaded by author 'samhog' to Hugging Face on July 17, 2023. Its specific content, size, and structure are not detailed in the provided metadata.
License is unknown, which may restrict commercial or research use.