Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Image-text pairs from the MS COCO 2017 dataset, sourced from cocodataset.org. The data is provided in two formats: a dense format with several sentences per image row and a long format with one caption per row, expanding the dataset length by a factor of five.
Users must distinguish between the 'coco2017' (dense format) and 'coco2017-long' (long format) variants on the Hugging Face page, as they have different row structures and lengths.