Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
25,000 multimodal examples likely containing images paired with text instructions and chain-of-thought reasoning. The dataset was created by author 'tomkld' and last updated on Hugging Face on December 10, 2024. Its columns suggest it contains image and text data for training vision-language models.
License is unknown, which may restrict commercial or research use.