Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
CodeReview-Bench is a software engineering benchmark curated by ronantakizawa for evaluating models on code editing and review tasks. It contains between 100,000 and 1,000,000 records derived from GitHub interactions, updated as of March 2026. The dataset is structured to support sequence-to-sequence tasks where natural language feedback is converted into code modifications.
Requires the Hugging Face datasets library for loading; the dataset is partitioned into 'code-editing' and potentially other task-specific subsets.