MWS Vision Bench: Russian Business OCR Benchmark for Multimodal LLMs

Name: MWS Vision Bench: Russian Business OCR Benchmark for Multimodal LLMs
Creator: MTSAIR
Published: 2025-10-08T14:00:55
Keywords: Russian Language, Multimodal Llm, Benchmark, Computer Vision, Business Documents, Ocr Benchmark, Multimodal

by MTSAIRUpdated 3mo ago

Available on 1 platform

Sign in to view source links and access this dataset

Description

MWS Vision Bench is the first Russian-language business OCR benchmark designed for multimodal large language models. The validation split is publicly available for open evaluation and comparison, with a paper expected soon. The dataset was uploaded by MTSAIR and last updated on March 11, -2026.

Use Cases

Benchmarking OCR performance of multimodal LLMs based on Russian business documents.
Comparing model performance across languages using the provided English and Chinese question configs.
Training models for business document understanding based on the multimodal benchmark structure.

Strengths

Designed as the first Russian-language business OCR benchmark for MLLMs.
Provides three language configurations (Russian, English, Chinese) for the same validation split.
Publicly available validation split intended for open evaluation and comparison.

Limitations

Row count, column names, and file formats are unknown, which may limit suitability assessment.
Description metadata is limited; actual data quality requires manual inspection after download.
Column-level documentation is absent; field semantics must be inferred after download.

Provenance

Source: MTSAIR
Collection Method: Likely contains annotated business document images and associated questions for benchmark tasks.
Time Range: null
Freshness: Last updated 2026-03-11 10:44:06; freshness should be verified.
Geography: null

null

Multimodal Russian Language Multimodal Llm Benchmark Computer Vision Business Documents Ocr Benchmark

Related Datasets

Quality Score

C41

Description

42

Source

39

Reputation

51

Access

26

Community

386 downloads

17 likes

0 views

Dataset Info

Author: MTSAIR
Created: Oct 8, 2025
Updated: Mar 11, 2026
Last synced: Jun 9, 2026

Access

26

Community

386 downloads

17 likes

0 views

Dataset Info

Author: MTSAIR
Created: Oct 8, 2025
Updated: Mar 11, 2026
Last synced: Jun 9, 2026

MWS Vision Bench: Russian Business OCR Benchmark for Multimodal LLMs

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info