Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
GenAI-Bench is a benchmark for evaluating multimodal large language models' ability to judge the quality of AI-generated content. The dataset is based on human preference data collected via the GenAI Arena platform and is maintained by TIGER-Lab. It was last updated on 2024-09-08.
License is unknown; terms of use must be verified before application.