Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
A webdataset likely containing 3 million examples for training multimodal AI models, as indicated by the title. It was published by author mvp-lab on the Hugging Face platform and last updated on September 20, 2025. The dataset appears to be associated with the LLaVA (Large Language and Vision Assistant) project, suggesting it contains paired image-text data.
License is unknown; users must verify permissions before commercial use.