MMEB Eval: Massive Multimodal Embedding Benchmark with 36 Datasets

Name: MMEB Eval: Massive Multimodal Embedding Benchmark with 36 Datasets
Creator: TIGER-Lab
Published: 2024-10-08T00:40:40
Keywords: Size Categories10 Kn100 K, Librarypolars, Librarydask, Languageen, Modalitytext, Ranking, Model Evaluation, Librarymlcroissant, Vision Language, Modalityimage, Librarydatasets, Benchmark, Computer Vision, Parquet, Arxiv241005160, Regionus, Large Scale, Multimodal Embeddings, Licenseapache 20, Multimodal

by TIGER-LabUpdated 1y ago

Available on 1 platform

Sign in to view source links and access this dataset

Description

A benchmark for evaluating multimodal embedding models, covering 4 meta tasks and 36 datasets. The dataset was created by TIGER-Lab and published in the paper 'VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks'. It was last updated on Hugging Face on October 28, 2024.

Use Cases

Benchmarking model performance on multimodal retrieval tasks based on the 4 meta tasks described.
Evaluating embedding quality across diverse vision-language datasets based on the 36 included datasets.
Training or fine-tuning multimodal models using the provided query-target evaluation examples.

Strengths

Covers 4 meta tasks and 36 datasets for evaluation.
Provides 1000 evaluation examples per dataset.
Published in a peer-reviewed paper (VLM2Vec).

Limitations

Column-level documentation is absent; field semantics must be inferred after download.
Row count is unknown, which may limit suitability assessment.

Provenance

Source: TIGER-Lab
Collection Method: Compiled from a set of evaluation tasks, as described in the associated paper.
Freshness: Last updated 2024-10-28 16:42:34; freshness should be verified.

License is unknown; users should verify the license on the dataset page before use.

Related Datasets

Quality Score

D36

Description

39

Source

36

Reputation

40

Access

22

Community

3.4K downloads

12 likes

0 views

Dataset Info

Author: TIGER-Lab
Created: Oct 8, 2024
Updated: Oct 28, 2024
Last synced: Jun 16, 2026

Access

22

Community

3.4K downloads

12 likes

0 views

Dataset Info

Author: TIGER-Lab
Created: Oct 8, 2024
Updated: Oct 28, 2024
Last synced: Jun 16, 2026

MMEB Eval: Massive Multimodal Embedding Benchmark with 36 Datasets

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info