MM-Corel-10K: Multimodal Image-Text Dataset for CBIR
Available on 1 platform
Sign in to view source links and access this dataset
Description
A multimodal dataset containing images and associated text, likely for Content-Based Image Retrieval (CBIR) research. It is hosted on Kaggle, but specific details like size, author, and update date are not provided in the available metadata. The dataset's content and structure require verification after download.
Use Cases
Train a model for cross-modal retrieval between images and text captions (inferred from domain, verify after download)
Benchmark content-based image retrieval (CBIR) algorithms (inferred from domain, verify after download)
Fine-tune vision-language models on paired image-text data (inferred from domain, verify after download)
Strengths
Published on Kaggle, a platform with an established data science community.
The title and description indicate it is a multimodal dataset designed for a specific research area (CBIR).
Limitations
Metadata is minimal; actual content requires verification after download.
Column-level documentation is absent; field semantics must be inferred after download.
Data may reflect bias inherent to the original Corel-10k collection, such as specific thematic or stylistic focus.
Provenance
Source
Kaggle
License is unknown; users must verify terms of use before applying the dataset.