Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Colonclip is a multimodal training dataset for medical computer vision models, likely containing colonoscopy images paired with text. The dataset was uploaded by author ZoeTAN to Hugging Face and was last updated on June 3, 2026. It includes LMDB archives for training, label embeddings, and a separate testing set.
The description notes that some files (train_texts.json, train_imgs.tsv) have an unclear encoding format and advises checking the code. Two CSV files are described as summaries for information only and are not used in the provided code.