DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

Full HH RLHF: Anthropic's Human Preference Data Reformatted for Training | DataSalon

Home Multimodal & LLMFull HH RLHF: Anthropic's Human Preference Data Reformatted for Training

Multimodal & LLM

Full HH RLHF: Anthropic's Human Preference Data Reformatted for Training

Name: Full HH RLHF: Anthropic's Human Preference Data Reformatted for Training
Creator: Dahoas
Published: 2022-12-16T20:45:27
Keywords: Rlhf, Text Generation, Preference Data, Text, Human Feedback

by Dahoas·Updated 3y ago

Description

Anthropic's HH dataset reformatted into prompt, chosen, and rejected samples by Dahoas. The data was last updated on Hugging Face in February 2023. It provides a structured format for training and evaluating language models using human preferences.

Use Cases

Training reward models based on human preference data.
Fine-tuning language models via reinforcement learning using the chosen and rejected samples.
Benchmarking model alignment techniques against a known human feedback dataset.
Studying the characteristics of helpful and harmless assistant responses.

Strengths

Derived from Anthropic's established HH dataset, a known source for human preference data.
Reformatted specifically for RLHF training workflows with prompt, chosen, and rejected samples.

Limitations

Description metadata is limited; actual data quality requires manual inspection after download.
Column-level documentation is absent; field semantics must be inferred after download.
Last updated 2023-02-23 17:29:46; freshness should be verified.

Provenance

Source: Anthropic's HH dataset, reformatted by Dahoas.
Collection Method: Reformatted from the original dataset structure.
Freshness: 2023-02-23 17:29:46

License is unknown; users should verify terms before use.

Text Rlhf Text Generation Preference Data Human Feedback

Related Datasets

Quality Score

D27

Description

Source

Reputation

Quality Score

D27

Description

Source

Reputation

Access

Community

374 downloads

88 likes

0 views

Dataset Info

Author: Dahoas
Created: Dec 16, 2022
Updated: Feb 23, 2023
Last synced: May 6, 2026

Access

Community

374 downloads

88 likes

0 views

Dataset Info

Author: Dahoas
Created: Dec 16, 2022
Updated: Feb 23, 2023
Last synced: May 6, 2026

Full HH RLHF: Anthropic's Human Preference Data Reformatted for Training

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info