Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
The Wikipedia-based Image Text (WIT) Dataset contains 37.6 million entity-rich image-text examples paired with 11.5 million unique images across 108 Wikipedia languages. It was created by keshan for pretraining multimodal machine learning models and was last updated in August 2021.
Column definitions, sample data, file formats, and license information are not provided in the input.