DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

OmniAlign-V: 205k Samples for Multimodal LLM Human Preference Alignment | DataSalon

Home Multimodal & LLMOmniAlign-V: 205k Samples for Multimodal LLM Human Preference Alignment

Multimodal & LLM

OmniAlign-V: 205k Samples for Multimodal LLM Human Preference Alignment

Name: OmniAlign-V: 205k Samples for Multimodal LLM Human Preference Alignment
Creator: PhoenixZ
Published: 2025-02-19T04:58:45
Keywords: Vision Language, Human Alignment, Multimodal Llm, Ai Training, Multimodal

by PhoenixZ·Updated 1y ago

Available on 1 platform

Description

205k high-quality samples for aligning Multimodal Large Language Models with human preferences. The dataset was created by PhoenixZ and is associated with the paper 'OmniAlign-V: Towards Enhanced Alignment of MLLMs with Human Preference'. It was last updated on March 1, 2025.

Use Cases

Fine-tuning MLLMs for better human preference alignment based on the described high-quality samples.
Training Direct Preference Optimization (DPO) models using the companion DPO dataset.
Benchmarking MLLM performance on alignment tasks using the referenced MM-AlignBench.
Developing new alignment techniques for vision-language models based on the multimodal training data.

Strengths

Contains 205k high-quality samples.
Is the official dataset for a published research paper.
Provides companion resources including a DPO dataset and evaluation benchmarks.

Limitations

Column-level documentation is absent; field semantics must be inferred after download.
Row count is unknown, which may limit suitability assessment.
Description metadata is limited; actual data quality requires manual inspection after download.

Provenance

Source: PhoenixZ
Freshness: Last updated 2025-03-01 09:21:45; freshness should be verified.

Multimodal Vision Language Human Alignment Multimodal Llm Ai Training

Related Datasets

Quality Score

D37

Description

Source

Reputation

Quality Score

D37

Description

Source

Reputation

Access

Community

219 downloads

8 likes

0 views

Dataset Info

Author: PhoenixZ
Created: Feb 19, 2025
Updated: Mar 1, 2025
Last synced: May 28, 2026

Access

Community

219 downloads

8 likes

0 views

Dataset Info

Author: PhoenixZ
Created: Feb 19, 2025
Updated: Mar 1, 2025
Last synced: May 28, 2026

OmniAlign-V: 205k Samples for Multimodal LLM Human Preference Alignment

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info