DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

OnchainPlanBench Seed: Web3 AI-Agent Action Plan Evaluations | DataSalon

Home Government & LegalOnchainPlanBench Seed: Web3 AI-Agent Action Plan Evaluations

Government & Legal

OnchainPlanBench Seed: Web3 AI-Agent Action Plan Evaluations

Name: OnchainPlanBench Seed: Web3 AI-Agent Action Plan Evaluations
Creator: CodePit
Published: 2026-06-02T12:15:27
Keywords: Benchmark, Ai Agents, Text, Action Planning, Web3, Safety Evaluation

by CodePit·Updated 5d ago

Available on 1 platform

Description

CodePit released OnchainPlanBench Seed, an early dataset for evaluating small open-weight models. The dataset tests a model's ability to critique, repair, reject, or approve Web3 AI-agent action plans before wallet execution. It was last updated on June 2, 2026.

Use Cases

Benchmarking model performance on plan critique tasks based on provided user intent and wallet context.
Training models to approve or reject action plans based on provided risk and privacy policies.
Developing tools for plan repair based on the available tools and policy constraints described in each row.

Strengths

Public seed release for the first official CodePit model track, CodePit PlanGuard.
Designed to test multiple safety functions: critique, repair, rejection, and approval.

Limitations

Row count is unknown, which may limit suitability assessment.
Column-level documentation is absent; field semantics must be inferred after download.
Description metadata is limited; actual data quality requires manual inspection after download.

Provenance

Source: CodePit
Freshness: Last updated 2026-06-02 12:16:34; freshness should be verified.

License is unknown; terms of use must be verified before application.

Text Benchmark Ai Agents Action Planning Web3 Safety Evaluation

Related Datasets

Quality Score

D37

Description

Source

Reputation

Quality Score

D37

Description

Source

Reputation

Access

Community

1 likes

0 views

Dataset Info

Author: CodePit
Created: Jun 2, 2026
Updated: Jun 2, 2026
Last synced: Jun 7, 2026

Access

Community

1 likes

0 views

Dataset Info

Author: CodePit
Created: Jun 2, 2026
Updated: Jun 2, 2026
Last synced: Jun 7, 2026

OnchainPlanBench Seed: Web3 AI-Agent Action Plan Evaluations

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info