Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
LAION-High-Quality-Pro-6M is a 6-million-sample image-text dataset used to train Vision-Language-Vision auto-encoder models. The dataset, hosted by author ccvl on Hugging Face, was last updated on September 20, 2025. It was created for scalable knowledge distillation from diffusion models.
The example code suggests data may be stored in base64 or raw bytes format, requiring specific decoding.