Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
UMM results for the UEval benchmark, a collection of outputs from various multimodal and large language models. The dataset was created by author wenwenw945 and last updated on April 9, 2026. It includes results from models like OmniGen2 and Emu3.5, configured for specific understanding tasks.
License is unknown; users should verify permissions before use. Some models like BLIP3O and TokenFlow are excluded due to lack of 'understanding' capability.