DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

LLaVA-NeXT: 100K-1M Multimodal Instruction Tuning Pairs | DataSalon

Home Multimodal & LLMLLaVA-NeXT: 100K-1M Multimodal Instruction Tuning Pairs

Multimodal & LLM

LLaVA-NeXT: 100K-1M Multimodal Instruction Tuning Pairs

Name: LLaVA-NeXT: 100K-1M Multimodal Instruction Tuning Pairs
Creator: lmms-lab
Published: 2024-08-08T12:03:51
Keywords: Librarypolars, Librarydask, Modalitytext, Size Categories100 Kn1 M, Librarymlcroissant, Modalityimage, Librarydatasets, Parquet, Regionus

by lmms-lab·Updated 1y ago

Available on 1 platform

Description

LLaVA-NeXT Data contains between 100,000 and 1,000,000 instruction-tuning pairs for multimodal large language models, released by lmms-lab in August 2024. It provides the specific data mixtures used to train the LLaVA-NeXT and LLaVA-NeXT (stronger) models, featuring synchronized image and text instruction sets.

Use Cases

Fine-tuning multimodal models using the provided image-text instruction pairs
Benchmarking vision-language models against the LLaVA-NeXT training distribution
Researching instruction-following capabilities in large vision-language models

Strengths

Contains between 100,000 and 1,000,000 records
Includes raw JSON and image formats for direct compatibility with LLaVA training pipelines
Used to train state-of-the-art open-source multimodal models

Limitations

Specific column names and schema are not detailed in the provided metadata
Large storage requirements due to high-volume image data

Provenance

Source: lmms-lab
Collection Method: Curated instruction tuning data mixture
Freshness: Last updated August 30, 2024, reflecting the most recent LLaVA-NeXT training iterations.

The dataset is available in Parquet format on Hugging Face, but also includes a de-compressed raw format with JSON files and structured image folders for users familiar with the LLaVA data format.

Parquet Librarypolars Librarydask Modalitytext Size Categories100 Kn1 M Librarymlcroissant Modalityimage Librarydatasets Regionus

Related Datasets

Quality Score

D37

Description

Source

Reputation

Quality Score

D37

Description

Source

Reputation

Access

Community

3.9K downloads

46 likes

0 views

Dataset Info

Author: lmms-lab
Created: Aug 8, 2024
Updated: Aug 30, 2024
Last synced: Apr 29, 2026

Access

Community

3.9K downloads

46 likes

0 views

Dataset Info

Author: lmms-lab
Created: Aug 8, 2024
Updated: Aug 30, 2024
Last synced: Apr 29, 2026

LLaVA-NeXT: 100K-1M Multimodal Instruction Tuning Pairs

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info