Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Clawbench is an open-source benchmark for evaluating AI agents on daily tasks performed in a web browser. It was created by TIGER-AI-Lab and last updated on May 25, 2026. The benchmark likely contains tasks designed to test an agent's ability to interact with web interfaces.
License is Apache-2.0, which permits commercial use with attribution.