Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
MM-RLHF is a project for aligning Multimodal Large Language Models with human preferences. The release includes a high-quality alignment dataset and a strong critique-based reward model. The project was open-sourced by yifanzhang114 in February 2025.
License is unknown; terms of use must be verified before application.