Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
A curated subset of 35,794 image-caption pairs from the Conceptual Captions dataset, re-annotated in Russian for accessibility. The data was processed through semantic clustering of 2,484 groups and re-annotated using teacher vision-language models. It was created by Pavel Mikheyev and last updated in May 2026.
License is CC-BY-4.0. Files are packaged in a 1.4 GB ZIP archive containing CSV, JSON, and JPEG formats.