Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
MINT-1T is an open-source multimodal dataset containing 1 trillion text tokens and 3.4 billion interleaved images, representing a tenfold scale-up from prior open-source collections. It was created by a team from the University of Washington to support research in multimodal pretraining, incorporating sources like PDFs and ArXiv papers.
The dataset page on Hugging Face must be consulted for the full description, access details, and any specific licensing terms, which are currently unknown.