Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Majestrino Unified Detailed Captions is a filtered subset of the laion/majestrino-data collection, containing all samples with a unified_detailed_caption field. The dataset comprises 4,658,407 samples, packaged in approximately 932 tar files totaling around 1,017 GB. It was created by TTS-AGI and last updated on March 29, 2026.
Data is distributed in many large tar files (~1.1 GB each); storage and extraction planning is required.