Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
A collection of filtered image-text pairs from academic resources, used for pre-training the Open-Qwen2VL multimodal large language model. The dataset includes subsets like ccs_ebdataset, derived from CC3M-CC12M-SBU and filtered by CLIP, and datacomp_medium_dfn_webdataset. It was created by weizhiwang and last updated on April 16, 2025.
License is unknown; users must verify permissions before use. The full description is hosted on a separate Hugging Face dataset page.