Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
ImageCoDe is a vision-and-language benchmark requiring contextual understanding of pragmatics, temporality, long descriptions, and visual nuances. The dataset was created by BennoKrojer and last updated on May 13, 2022. The specific row count, column count, and dataset size are unknown.
To use the dataset fully, users must download a separate 'image_sets.zip' file containing the images, as the Hugging Face loader may only provide descriptions. The specific license is tagged but not explicitly stated (e.g., 'afl 30').