DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

Clawbench: Open-Source Benchmark for Browser AI Agents | DataSalon

Home Government & LegalClawbench: Open-Source Benchmark for Browser AI Agents

Government & Legal

Clawbench: Open-Source Benchmark for Browser AI Agents

Name: Clawbench: Open-Source Benchmark for Browser AI Agents
Creator: TIGER-AI-Lab
Published: 2026-04-10T01:59:17
License: Apache-2.0
Keywords: Benchmark, Ai Agents, Tabular, Browser Automation, Open Source

by TIGER-AI-Lab / TIGER-AI-Lab·Updated 28d ago

Available on 1 platform

Description

Clawbench is an open-source benchmark for evaluating AI agents on daily tasks performed in a web browser. It was created by TIGER-AI-Lab and last updated on May 25, 2026. The benchmark likely contains tasks designed to test an agent's ability to interact with web interfaces.

Use Cases

Benchmarking AI agent performance on daily web tasks based on the described evaluation framework.
Training browser-based AI assistants on structured tasks mentioned in the description.
Comparing different AI agent architectures on a standardized set of web interaction challenges.

Strengths

Released under the permissive Apache-2.0 license, allowing for broad use and modification.
Open-source nature facilitates community review and contribution to the benchmark.

Limitations

Column-level documentation is absent; field semantics must be inferred after download.
Row count is unknown, which may limit suitability assessment for large-scale training.

Provenance

Source: TIGER-AI-Lab
Freshness: Last updated 2026-05-25 00:33:10; freshness should be verified.

License is Apache-2.0, which permits commercial use with attribution.

Tabular Benchmark Ai Agents Browser Automation Open Source

Related Datasets

Quality Score

D29

Description

Source

Reputation

Quality Score

D29

Description

Source

Reputation

Access

Community

353 likes

0 views

Dataset Info

License: Apache-2.0
Author: TIGER-AI-Lab
Org: TIGER-AI-Lab
Created: Apr 10, 2026
Updated: May 25, 2026
Language: Python
Last synced: Jun 22, 2026

Access

Community

353 likes

0 views

Dataset Info

License: Apache-2.0
Author: TIGER-AI-Lab
Org: TIGER-AI-Lab
Created: Apr 10, 2026
Updated: May 25, 2026
Language: Python
Last synced: Jun 22, 2026

Clawbench: Open-Source Benchmark for Browser AI Agents

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info