Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
OpenCodeReasoning-2 contains 1.4 million Python and 1.1 million C++ samples derived from 34,799 unique competitive programming questions. This synthetic dataset is designed for supervised fine-tuning tasks in code completion and critique. It was created by NVIDIA and released on the Hugging Face platform in May 2025.
The full dataset description and details on columns, file formats, and license are hosted externally on the Hugging Face dataset page.