Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Filtered WIT is an image-text dataset derived from the Wikipedia Image Text (WIT) dataset, containing 10,000 samples per archived tar file. Each sample includes a .jpg image, a .txt caption, and a .json metadata file. The dataset is provided by LAION and was last updated in January 2022.
Data is stored in tar archives containing 10,000 samples each; users must handle tar extraction. The full description and specifics are on the Hugging Face dataset page. License information is unknown.