Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
BanglaVision40K is a combined dataset for Bengali image captioning, merging the Flickr30k dataset with Bangladeshi sources. The dataset likely contains image-text pairs for training and evaluating vision-language models. Its author, organization, and specific size are unknown.
License is unknown; users must verify permissions before use.