Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Japanese Medical VQA 12M provides 12 million multimodal records for medical visual question answering, developed by MIL-UT and released in 2026. It consists of medical images paired with English and Japanese text across five distinct data-construction stages including captions and Q&A pairs.
Released under CC BY-SA 4.0 license. Available in Parquet and Webdataset formats, optimized for use with Polars, Dask, and Hugging Face Datasets libraries.