Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
This is the training split for the Massive Multimodal Embedding Benchmark (MMEB), used to train VLM2Vec models as described in an ICLR 2025 paper. It comprises data from 20 out of 36 datasets selected for evaluating multimodal embedding models across 4 meta tasks.
The full dataset description, including specific data details and license, is hosted externally at https://huggingface.co/datasets/TIGER-Lab/MMEB-train.