DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

LLaVA-Instruct-150K: Large-scale Language-and-Vision Assistant Instruction Dataset | DataSalon

Home Multimodal & LLMLLaVA-Instruct-150K: Large-scale Language-and-Vision Assistant Instruction Dataset

Multimodal & LLM

LLaVA-Instruct-150K: Large-scale Language-and-Vision Assistant Instruction Dataset

Name: LLaVA-Instruct-150K: Large-scale Language-and-Vision Assistant Instruction Dataset
Creator: liuhaotian
Published: 2023-04-17T23:47:27
Keywords: Task Categoriesquestion Answering, Languageen, Task Categoriesvisual Question Answering, Size Categories100 Kn1 M, Licensecc By 40, Regionus

by liuhaotian·Updated 2y ago

Available on 1 platform

Description

150,000 GPT-generated multimodal instruction-following data points collected in April 2023. The dataset utilizes the GPT-4-0314 API to synthesize vision-language interactions for the development of large multimodal models.

Use Cases

Fine-tune vision-language models using the instruction-following pairs to improve multimodal task performance
Train large multimodal models (LMMs) to interpret images based on the GPT-generated natural language instructions
Benchmark open-source vision models against synthetic data generated by the GPT-4-0314 API

Strengths

150,000 multimodal instruction-following data points
Generated using the GPT-4-0314 API in April 2023
Formatted for visual instruction tuning of large multimodal models

Task Categoriesquestion Answering Languageen Task Categoriesvisual Question Answering Size Categories100 Kn1 M Licensecc By 40 Regionus

Related Datasets

Quality Score

D39

Description

Source

Reputation

Quality Score

D39

Description

Source

Reputation

Access

Community

5.6K downloads

583 likes

0 views

Dataset Info

Author: liuhaotian
Created: Apr 17, 2023
Updated: Jan 3, 2024
Last synced: Jul 26, 2026

Access

Community

5.6K downloads

583 likes

0 views

Dataset Info

Author: liuhaotian
Created: Apr 17, 2023
Updated: Jan 3, 2024
Last synced: Jul 26, 2026

LLaVA-Instruct-150K: Large-scale Language-and-Vision Assistant Instruction Dataset

Description

Use Cases

Strengths

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info