Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
WildFB contains 186,000 instances of human-LLM conversational interactions, each labeled with a 4-level ordinal satisfaction score. The dataset was created by THU-KEG to train reward models using implicit feedback signals extracted from user follow-up queries.
The full dataset details, including columns, sample data, file formats, and license, are described on the external Hugging Face page.