DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

HH_length_biased_15k: A Length-Biased RLHF Subset | DataSalon

Home Multimodal & LLMHH_length_biased_15k: A Length-Biased RLHF Subset

Multimodal & LLM

HH_length_biased_15k: A Length-Biased RLHF Subset

Name: HH_length_biased_15k: A Length-Biased RLHF Subset
Creator: Taywon
Published: 2024-12-05T00:27:48
Keywords: Rlhf, Text, Language Model, Ai Training, Text, Length Bias, Human Feedback

by Taywon·Updated 1y ago

Available on 1 platform

Description

HH_length_biased_15k is a 15,000-sample subset of Anthropic/hh-rlhf, created for the paper 'Understanding impacts of human feedback via influence functions'. Taywon Min authored this dataset, which was last updated on December 5, 2024. It contains 976 samples where responses were intentionally flipped to be lengthy.

Use Cases

Analyzing the impact of response length bias on RLHF training outcomes based on the described flipped samples.
Studying influence functions in human feedback datasets based on the paper's stated research focus.
Benchmarking language model training with artificially induced length preferences based on the dataset's construction.

Strengths

Dataset size is explicitly stated as 15,000 samples.
Contains 976 specifically manipulated samples to induce length bias.
Derived from the established Anthropic/hh-rlhf dataset.

Limitations

Column-level documentation is absent; field semantics must be inferred after download.
Row count is unknown, which may limit suitability assessment.
Last updated 2024-12-05 01:05:11; freshness should be verified.

Provenance

Source: Anthropic/hh-rlhf
Collection Method: A random subset with 976 samples flipped towards lengthy responses.
Freshness: 2024-12-05 01:05:11

License is unknown.

Text Rlhf Language Model Ai Training Length Bias Human Feedback

Related Datasets

Quality Score

D39

Description

Source

Reputation

Quality Score

D39

Description

Source

Reputation

Access

Community

15 downloads

1 likes

0 views

Dataset Info

Author: Taywon
Created: Dec 5, 2024
Updated: Dec 5, 2024
Last synced: May 11, 2026

Access

Community

15 downloads

1 likes

0 views

Dataset Info

Author: Taywon
Created: Dec 5, 2024
Updated: Dec 5, 2024
Last synced: May 11, 2026

HH_length_biased_15k: A Length-Biased RLHF Subset

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info