Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
A dataset of graded summaries from the Proofbench evaluation for the Qwen3 4B model, published by author violetxi on Hugging Face. The dataset appears to contain text outputs from a fine-tuned language model assessed for reasoning or safety. The platform tags indicate the data is in JSON format and primarily textual.
License is unknown; usage rights must be verified.