DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

MME-CoT: Benchmarking Chain-of-Thought Reasoning in Multimodal Models | DataSalon

Home Multimodal & LLMMME-CoT: Benchmarking Chain-of-Thought Reasoning in Multimodal Models

Multimodal & LLM

MME-CoT: Benchmarking Chain-of-Thought Reasoning in Multimodal Models

Name: MME-CoT: Benchmarking Chain-of-Thought Reasoning in Multimodal Models
Creator: CaraJ
Published: 2025-02-08T17:51:41
Keywords: Chain Of Thought, Benchmark, Reasoning Evaluation, Large Language Models, Multimodal Benchmark, Multimodal

by CaraJ·Updated 1y ago

Available on 1 platform

Description

MME-CoT is a benchmark dataset for evaluating Chain-of-Thought reasoning in Large Multimodal Models. It was created by author CaraJ and published on Hugging Face, with its last update recorded on 2025-03-19. The dataset focuses on assessing reasoning quality, robustness, and efficiency.

Use Cases

Benchmarking reasoning quality in multimodal models based on the described Chain-of-Thought evaluation
Assessing model robustness across different reasoning tasks as indicated by the dataset's purpose
Evaluating the efficiency of reasoning processes in large multimodal models
Training or fine-tuning models to improve multimodal reasoning performance

Strengths

Dataset is associated with a published paper and a project page with visualization tools
Last updated on 2025-03-19, indicating recent maintenance

Limitations

Description metadata is limited; actual data quality requires manual inspection after download
Row count is unknown, which may limit suitability assessment
Column-level documentation is absent; field semantics must be inferred after download

Provenance

Source: CaraJ
Freshness: Last updated 2025-03-19 07:18:26

Multimodal Chain Of Thought Benchmark Reasoning Evaluation Large Language Models Multimodal Benchmark

Related Datasets

Quality Score

D38

Description

Source

Reputation

Quality Score

D38

Description

Source

Reputation

Access

Community

909 downloads

22 likes

0 views

Dataset Info

Author: CaraJ
Created: Feb 8, 2025
Updated: Mar 19, 2025
Last synced: Jul 16, 2026

Access

Community

909 downloads

22 likes

0 views

Dataset Info

Author: CaraJ
Created: Feb 8, 2025
Updated: Mar 19, 2025
Last synced: Jul 16, 2026

MME-CoT: Benchmarking Chain-of-Thought Reasoning in Multimodal Models

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info