Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
56 total instances of agent performance on a software engineering benchmark. The dataset, created by user zvzv1919, was last updated on 2026-04-17. It likely contains metrics from evaluating an agent's ability to locate and match code functions and files.
License is unknown; terms of use must be verified before application.