DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

DataClaw: Benchmark Tasks for End-to-End Data Analysis Agents | DataSalon

Home Government & LegalDataClaw: Benchmark Tasks for End-to-End Data Analysis Agents

Government & Legal

DataClaw: Benchmark Tasks for End-to-End Data Analysis Agents

Name: DataClaw: Benchmark Tasks for End-to-End Data Analysis Agents
Creator: GTML-LAB
Published: 2026-04-29T03:50:58
Keywords: End To End Agents, Heterogeneous Data, Benchmark, Data Analysis Benchmark, Ai Agent Evaluation, Multimodal

by GTML-LAB·Updated 1mo ago

Available on 1 platform

Description

A benchmark dataset for evaluating OpenClaw-style end-to-end agents on data analysis tasks. Every task is grounded in real-world data and has a single objective gold answer. The dataset was created by GTML-LAB and was last updated on April 30, 2026.

Use Cases

Benchmarking agent performance on tasks requiring evidence location across heterogeneous files.
Evaluating agent capabilities in filtering and processing real-world data.
Training end-to-end agents to handle multi-step data analysis with a single objective answer.

Strengths

Tasks are grounded in real-world data, providing practical evaluation scenarios.
Each task has a single objective gold answer, enabling clear performance measurement.
The dataset was last updated on April 30, 2026.

Limitations

Column-level documentation is absent; field semantics must be inferred after download.
Row count is unknown, which may limit suitability assessment.
Description metadata is limited; actual data quality requires manual inspection after download.

Provenance

Source: GTML-LAB
Freshness: Last updated 2026-04-30 07:14:23; freshness should be verified.

License is unknown; terms of use must be verified before application.

Multimodal End To End Agents Heterogeneous Data Benchmark Data Analysis Benchmark Ai Agent Evaluation

Related Datasets

Quality Score

D37

Description

Source

Reputation

Quality Score

D37

Description

Source

Reputation

Access

Community

41 downloads

1 likes

0 views

Dataset Info

Author: GTML-LAB
Created: Apr 29, 2026
Updated: Apr 30, 2026
Last synced: May 9, 2026

Access

Community

41 downloads

1 likes

0 views

Dataset Info

Author: GTML-LAB
Created: Apr 29, 2026
Updated: Apr 30, 2026
Last synced: May 9, 2026

DataClaw: Benchmark Tasks for End-to-End Data Analysis Agents

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info