Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
AgentProcessBench provides 1,000 full agent trajectories for process-level evaluation of tool-using agents. The benchmark was created by LulaCola and published in March 2026. Each trajectory includes multi-turn messages, tool definitions, tool-use traces, reference outputs, and step-wise process labels.
License information is not specified in the provided metadata; verify before use. The specific file formats and data schema are unknown.