DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

Wild Feedback Dataset With 186k Ordinal Satisfaction Scores | DataSalon

Home Reinforcement LearningWild Feedback Dataset With 186k Ordinal Satisfaction Scores

Reinforcement Learning

Wild Feedback Dataset With 186k Ordinal Satisfaction Scores

Name: Wild Feedback Dataset With 186k Ordinal Satisfaction Scores
Creator: THU-KEG
Published: 2026-02-26T02:15:53
Keywords: English, Conversational Ai, Text, Large Language Models, Human Feedback

by THU-KEG·Updated 4mo ago

Available on 1 platform

Description

WildFB contains 186,000 instances of human-LLM conversational interactions, each labeled with a 4-level ordinal satisfaction score. The dataset was created by THU-KEG to train reward models using implicit feedback signals extracted from user follow-up queries.

Use Cases

Train a reward model to predict the 4-level ordinal satisfaction score from conversational text.
Analyze patterns in user follow-up queries to identify implicit feedback signals for LLM alignment.
Fine-tune a language model using the satisfaction score labels as a supervision signal for conversational quality.

Strengths

Contains 186,000 high-quality instances filtered from a larger source.
Provides a 4-level ordinal satisfaction score for each conversational instance.
Derives implicit reward signals from real-world human-LLM interactions.

Limitations

Specific column names and data structure are not provided in the input.
The dataset's geographic and temporal coverage are unknown.
The original source, WildChat-4.8M, may introduce biases from its in-the-wild collection.

Provenance

Source: THU-KEG
Collection Method: Filtered and refined from the WildChat-4.8M dataset, extracting implicit reward signals from user follow-up queries.
Time Range: null
Freshness: Last updated February 2026.
Geography: null

The full dataset details, including columns, sample data, file formats, and license, are described on the external Hugging Face page.

Text English Conversational Ai Large Language Models Human Feedback

Related Datasets

Quality Score

D37

Description

Source

Reputation

Quality Score

D37

Description

Source

Reputation

Access

Community

42 downloads

3 likes

0 views

Dataset Info

Author: THU-KEG
Created: Feb 26, 2026
Updated: Feb 26, 2026

Access

Community

42 downloads

3 likes

0 views

Dataset Info

Author: THU-KEG
Created: Feb 26, 2026
Updated: Feb 26, 2026

Wild Feedback Dataset With 186k Ordinal Satisfaction Scores

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info