Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Innovator-VL-RL-172K is a curated multimodal reinforcement learning dataset containing 172,000 instances released by InnovatorLab in 2026. It provides image-text reasoning pairs designed to support RLHF-style optimization for vision-language models.
The dataset is provided in Parquet format and is compatible with Polars, Dask, and Hugging Face Datasets libraries.