Sign in to view source links and access this dataset
Description
31,511 public-domain artworks focused on animals and the natural world, including 15,920 paintings and illustrations and 15,407 photographed objects. Each work is paired with a structured visual-language model caption and metadata on medium, attribution, and inscriptions. The dataset was created by jaddai and last updated on Hugging Face in May 2026.
Use Cases
Train image-captioning models based on the structured VLM captions for diverse artistic depictions of fauna.
Analyze the distribution of artistic mediums (e.g., carved, cast, glazed) used to represent animals across cultures and time periods.
Fine-tune classifiers to identify animal species or artistic styles within a large, labeled collection of public-domain art.
Study metadata patterns, such as artist attribution and inscriptions, within a thematic art collection.
Strengths
Contains 31,511 individual artworks, providing substantial scale for model training.
Includes a structured VLM caption for each work, which likely provides detailed, machine-readable descriptions.
Covers multiple artistic mediums, with 15,920 paintings/illustrations and 15,407 photographed objects.
Limitations
Column-level documentation is absent; field semantics must be inferred after download.
Row count is unknown, which may limit suitability assessment.
Data may reflect temporal or source bias inherent to the original OpenArt collection.
Provenance
Source
Part of the OpenArt family of open, public-domain art datasets.
Collection Method
Likely aggregated and processed from multiple public-domain art sources, with added structured captions.
Freshness
Last updated 2026-05-28 01:42:21; freshness should be verified.
License is unknown; users must verify terms of use before downloading.