Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
MINT-1T is an open-source multimodal interleaved dataset containing 1 trillion text tokens and 3.4 billion images, representing a 10x scale-up from previous open-source collections. It was created by a team from the University of Washington and includes sources such as PDFs and ArXiv papers to facilitate multimodal pretraining research. The dataset was last updated on the platform in September 2024.
License is listed as CC BY 4.0 in platform tags, but should be verified on the official dataset page.