Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
AgentCollabBench is a diagnostic benchmark dataset for multi-agent LLM systems, created by AgentCollabBench and last updated on 2026-05-06. It targets process-level failures that single-agent benchmarks cannot expose, such as failures emerging from inter-agent communication.
License is unknown; restrictions should be verified before use.