Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
KORE-74K is a multimodal dataset containing over 74,000 training entries for image recognition, captioning, and visual question answering tasks. It was created by author kailinjiang and published in 2026, building upon the MMEVOKE dataset. The data includes separate archives for recognition/caption images and VQA images, paired with structured JSON annotations.
The image data is split into multiple zip parts that must be concatenated using the command `cat split_zip_part_* > combined.zip` before use. The dataset is intended to be combined with MMEVOKE's image data for complete training.