Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
OpenRubrics' RubricARROW-Judge-SFT dataset provides instruction-tuning style data for training a judge model in the RubricARROW reinforcement learning framework. The dataset is hosted on Hugging Face and was last updated on May 27, 2026. It is intended for post-training large language models in non-verifiable domains.
License is unknown, which may restrict commercial use or redistribution.