Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
164 handwritten Python programming problems were released by OpenAI in 2021 to evaluate the functional correctness of code generation models. Each entry provides a function signature, docstring, reference implementation, and unit tests for automated validation.
Requires a secure sandbox environment to execute generated code against the included unit tests to prevent arbitrary code execution risks.