Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
A finetuned version of the BLIP model on the COCO dataset, likely containing image-text pairs for action captioning tasks. The dataset is hosted on Kaggle, but its specific size, columns, and creation details are unknown. Its content and scale require verification after download.
License information is unknown; users must verify licensing terms before use.