Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Art Museums PD 440K is a dataset for training text-to-image and multimodal models, containing images and captions sourced from public domain or CC0-licensed materials. The dataset includes English captions translated to Japanese using the ElanMT model, which was trained on licensed corpus. The creator is Mitsua, with the dataset last updated on February 13, 2025.
Users should review the full dataset description on the Hugging Face page for complete details on sources and structure, as specific column information and sample data are unavailable in this summary.