Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
56 test instances were used to evaluate a software engineering agent's performance. The dataset, created by user zvzv1919 and last updated on April 23, 2026, reports metrics including 50 file matches (89%) and 49 function matches (87%) from a benchmark collection.
License is unknown; terms of use must be verified before application.