Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
ALLaVA-4V is a multimodal dataset created by FreedomIntelligence using GPT-4V to generate detailed captions and complex reasoning question-answer pairs for images. The dataset incorporates data from sources like LAION and WizardLM, with its generation pipeline and prompts documented on the project page. It was last updated on June 8, 2025.
Complete dataset details, including specific columns, sample data, size, and license, are only available on the Hugging Face dataset page. Users must review the source page for full documentation.