Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
PRDbench is a benchmark dataset with 50 test cases for evaluating code agents' development capabilities in real-world environments. Each test case includes a PRD requirement (PRD query) and an acceptance scoring scheme (Criteria). The dataset is authored by AGI-Eval and was last updated in December 2025.
The full description is hosted externally; users must visit the dataset page for complete details. License information is unknown.