Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
PubMedVision is a large-scale medical visual question answering dataset built from image-text pairs extracted from PubMed. FreedomIntelligence enhanced the data quality using GPT-4V and added annotations for body parts and modality. The dataset was updated in February 2025.
Includes separate JSON files for original captions and a Chinese version. Annotations for body_part and modality are model-generated.