Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
300 test Issue-Pull Request pairs from 11 popular Python repositories, released as part of the SWE-bench research project. The dataset was created to test systems' ability to automatically resolve real-world GitHub issues, with evaluation performed by verifying unit tests using post-PR behavior as the reference solution. It was released by the SWE-bench organization and last updated on 2025-04-29.
The dataset only contains a subset; the full description is on the Hugging Face dataset page.