Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
RLHF-V-Dataset is a large-scale multimodal feedback dataset constructed using open-source models for reinforcement learning. It was released by the openbmb organization in May 2024 and has been utilized in models like MiniCPM-V 2.0. The dataset is designed for diverse tasks involving computer vision and large language models.
The dataset page mentions a related 'RLAIF-V-Dataset' release; users should verify which dataset suits their needs. The full description is hosted externally on Hugging Face, requiring a visit for complete details.