DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

Psychology RLHF: Data for Training a LLaMA-7B Reward Model | DataSalon

Home Multimodal & LLMPsychology RLHF: Data for Training a LLaMA-7B Reward Model

Multimodal & LLM

Psychology RLHF: Data for Training a LLaMA-7B Reward Model

Name: Psychology RLHF: Data for Training a LLaMA-7B Reward Model
Creator: samhog
Published: 2023-07-17T14:46:35
Keywords: Rlhf, Psychology, Text, Llm Training, Reward Model

by samhog·Updated 3y ago

Available on 1 platform

Description

Psychology RLHF data was used to train a LLaMA-7B reward model. The dataset was uploaded by author 'samhog' to Hugging Face on July 17, 2023. Its specific content, size, and structure are not detailed in the provided metadata.

Use Cases

Training a reward model for RLHF based on psychology-related prompts and responses.
Fine-tuning language models for tasks involving psychological reasoning or dialogue.
Benchmarking RLHF training pipelines on a domain-specific dataset.

Strengths

Dataset has a specific, documented purpose: training a LLaMA-7B reward model.
Author and upload date (2023-07-17) are explicitly provided.

Limitations

Description metadata is limited; actual data quality requires manual inspection after download.
Column-level documentation is absent; field semantics must be inferred after download.
Row count and file formats are unknown, which may limit suitability assessment.

Provenance

Source: huggingface
Freshness: Last updated 2023-07-17 14:48:32; freshness should be verified.

License is unknown, which may restrict commercial or research use.

Text Rlhf Psychology Llm Training Reward Model

Related Datasets

Quality Score

D27

Description

Source

Reputation

Quality Score

D27

Description

Source

Reputation

Access

Community

44 downloads

9 likes

0 views

Dataset Info

Author: samhog
Created: Jul 17, 2023
Updated: Jul 17, 2023
Last synced: May 21, 2026

Access

Community

44 downloads

9 likes

0 views

Dataset Info

Author: samhog
Created: Jul 17, 2023
Updated: Jul 17, 2023
Last synced: May 21, 2026

Psychology RLHF: Data for Training a LLaMA-7B Reward Model

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info