Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
200 instances comprise this benchmark for evaluating software engineering agents, with 95% file match and 89% function match rates. The dataset was created by author zvzv1919 and last updated on 2026-04 17. It appears to be part of a collection for testing agents on tasks like code location and function matching.
License is unknown, which may restrict usage.