DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

AFTER: A Benchmark for Agent Skill Evolution Frameworks | DataSalon

Home Biology & EcologyAFTER: A Benchmark for Agent Skill Evolution Frameworks

Biology & Ecology

AFTER: A Benchmark for Agent Skill Evolution Frameworks

Name: AFTER: A Benchmark for Agent Skill Evolution Frameworks
Creator: DavydenkoGr
Published: 2026-06-19T11:05:55
Keywords: Machine Learning, Agent Evaluation, Ai Benchmark, Benchmark, Text, Skill Evolution

by DavydenkoGr·Updated 6d ago

Available on 1 platform

Description

AFTER is a benchmark for studying skill evolution in agentic frameworks, measuring their ability to revise, specialize, and reuse skill instructions. The test split contains 129 tasks, with the full dataset scheduled for future release. It was created by DavydenkoGr and last updated on June 19, 2026.

Use Cases

Benchmarking agent skill improvement based on the described ability to revise and reuse skill instructions.
Evaluating skill transfer across roles and tasks as described in the dataset's purpose.
Studying agent performance in different execution contexts as mentioned in the abstract.

Strengths

The test split contains 129 tasks available for immediate use.
The dataset is designed for a specific, measurable research goal: evaluating skill evolution frameworks.

Limitations

The full dataset is not yet available, limiting current scope.
Column-level documentation is absent; field semantics must be inferred after download.
Row count and data scale are unknown, which may limit suitability assessment.

Provenance

Source: DavydenkoGr on Hugging Face.
Freshness: Last updated 2026-06-19 11:24:43; freshness should be verified.

License is unknown; users should verify terms before use.

Text Machine Learning Agent Evaluation Ai Benchmark Benchmark Skill Evolution

Related Datasets

Quality Score

D37

Description

Source

Reputation

Quality Score

D37

Description

Source

Reputation

Access

Community

38 downloads

1 likes

0 views

Dataset Info

Author: DavydenkoGr
Created: Jun 19, 2026
Updated: Jun 19, 2026
Last synced: Jun 25, 2026

Access

Community

38 downloads

1 likes

0 views

Dataset Info

Author: DavydenkoGr
Created: Jun 19, 2026
Updated: Jun 19, 2026
Last synced: Jun 25, 2026

AFTER: A Benchmark for Agent Skill Evolution Frameworks

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info