Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
318 code completion tasks obtained from 27 popular GitHub C/C++ repositories covering 15 Common Weakness Enumerations (CWEs). The benchmark, created by ai-sec-lab and last updated in November 2025, is built upon the ARVO dataset and is designed to evaluate large language models and agent frameworks for secure code generation.
License is unknown; terms of use must be verified before application.