Name: ViRL39K: 38,870 Verifiable QAs for Vision-Language RL Training
Creator: TIGER-Lab
Published: 2025-04-21T11:56:11
Keywords: Task Categoriesimage Text To Text, Task Categoriesquestion Answering, Languageen, Modalityimage, Arxiv250408837, Training, Regionus, Reinforcement Learning, Licensemit

Description

ViRL39K contains 38,870 verifiable question-answer pairs designed for Vision-Language Reinforcement Learning training, released by TIGER-Lab in April 2025. It aggregates and refines data from seven specialized sources, including Llava-OneVision, MM-Math, and DeepScaleR, through a process of cleaning, reformatting, and verification.

Use Cases

Training Vision-Language Reasoning Models using the verifiable QA pairs for reward signal generation
Fine-tuning multimodal models on the MM-Math and MV-Math subsets for geometric and mathematical reasoning
Implementing reinforcement learning pipelines using the reformatted chain-of-thought (M3CoT) data

Strengths

38,870 verifiable QA pairs
Consolidates 7 distinct vision-language datasets including MM-Eureka and DeepScaleR
MIT licensed for open research

Limitations

Secondary dataset composed of reformatted existing sources rather than original primary collection
Potential domain bias toward mathematical reasoning due to source selection (MM-Math, MV-Math)

Provenance

Source: TIGER-Lab (Arxiv 2504.08837)
Collection Method: Aggregated from existing datasets (Llava-OneVision, R1-OneVision, MM-Eureka, MM-Math, M3CoT, DeepScaleR, MV-Math) with cleaning and verification.
Freshness: Last updated April 2025.

The dataset serves as the foundation for the VL-Rethinker model; users should refer to Arxiv 2504.08837 for specific details on the verification logic used to ensure QA accuracy for RL training.

Task Categoriesimage Text To Text Task Categoriesquestion Answering Languageen Modalityimage Arxiv250408837 Training Regionus Reinforcement Learning Licensemit

ViRL39K: 38,870 Verifiable QAs for Vision-Language RL Training

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info