DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

CodeReview-Bench: 100K+ GitHub Code Editing and Review Pairs | DataSalon

Home Media & CommunicationCodeReview-Bench: 100K+ GitHub Code Editing and Review Pairs

Media & Communication

CodeReview-Bench: 100K+ GitHub Code Editing and Review Pairs

Name: CodeReview-Bench: 100K+ GitHub Code Editing and Review Pairs
Creator: ronantakizawa
Published: 2026-03-05T09:06:52
Keywords: Languagecode, Task Categoriestext Generation, Librarypolars, Languageen, Modalitytext, Size Categories100 Kn1 M, Librarymlcroissant, Software Engineering, Librarydatasets, Benchmark, Librarypandas, Parquet, Code Review, Code Generation, Regionus, Licensemit

by ronantakizawa·Updated 4mo ago

Available on 1 platform

Description

CodeReview-Bench is a software engineering benchmark curated by ronantakizawa for evaluating models on code editing and review tasks. It contains between 100,000 and 1,000,000 records derived from GitHub interactions, updated as of March 2026. The dataset is structured to support sequence-to-sequence tasks where natural language feedback is converted into code modifications.

Use Cases

Generating 'after_code' from 'before_code' and 'reviewer_comment' inputs.
Analyzing model performance across different programming 'language' categories.
Evaluating the utility of 'diff_context' in predicting precise code modifications.

Strengths

100,000 to 1,000,000 records
MIT license
Includes 'diff_context' for localized code changes

Limitations

Potential for label noise due to the organic nature of GitHub comments
Geographic bias toward US-based data as indicated by dataset tags
Lack of manual verification for the correctness of the 'after_code' targets

Provenance

Source: ronantakizawa/github-codereview
Collection Method: curated from GitHub
Freshness: Updated March 2026.
Geography: US

Requires the Hugging Face datasets library for loading; the dataset is partitioned into 'code-editing' and potentially other task-specific subsets.

Parquet Languagecode Task Categoriestext Generation Librarypolars Languageen Modalitytext Size Categories100 Kn1 M Librarymlcroissant Software Engineering Librarydatasets Benchmark Librarypandas Code Review Code Generation Regionus Licensemit

Related Datasets

Quality Score

D36

Description

Source

Reputation

Quality Score

D36

Description

Source

Reputation

Access

Community

84 downloads

3 likes

0 views

Dataset Info

Author: ronantakizawa
Created: Mar 5, 2026
Updated: Mar 5, 2026

Access

Community

84 downloads

3 likes

0 views

Dataset Info

Author: ronantakizawa
Created: Mar 5, 2026
Updated: Mar 5, 2026

CodeReview-Bench: 100K+ GitHub Code Editing and Review Pairs

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info