Sign in to view source links and access this dataset
Description
BLIP3o-60k is a dataset distilled from GPT-4o for instruction tuning of text-to-image models. It includes categories such as JourneyDB, human-centric data from MSCOCO, Dalle3 outputs, Geneval, common objects, and simple text. The dataset was created by BLIP3o and last updated on May 25, 2025.
Use Cases
Fine-tuning text-to-image models based on GPT-4o distilled instructions.
Training models on human-centric image-caption pairs based on MSCOCO data.
Evaluating model performance on common object recognition tasks based on the 'common objects' category.
Benchmarking instruction-following capabilities based on the 'simple text' category.
Strengths
Dataset is distilled from GPT-4o, a state-of-the-art multimodal model.
Includes multiple distinct categories such as JourneyDB, MSCOCO, and Dalle3 outputs.
Last updated on May 25, 2025, indicating recent maintenance.
Limitations
Column-level documentation is absent; field semantics must be inferred after download.
Row count, file formats, and license information are unknown, which may limit suitability assessment.
Provenance
Source
Distilled from GPT-4o.
Collection Method
Instruction tuning dataset creation.
Freshness
2025-05-25 18:15:37
License is unknown; users should verify licensing terms before use.