DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

AgentMorph Bugs V0.1: Trajectory-Level Metamorphic Testing Candidates | DataSalon

Home Media & CommunicationAgentMorph Bugs V0.1: Trajectory-Level Metamorphic Testing Candidates

Media & Communication

AgentMorph Bugs V0.1: Trajectory-Level Metamorphic Testing Candidates

Name: AgentMorph Bugs V0.1: Trajectory-Level Metamorphic Testing Candidates
Creator: Anonymous2535k
Published: 2026-05-07T04:59:25
Keywords: Bug Candidates, Llm Agents, Tool Use, Benchmark, Metamorphic Testing, Tabular

by Anonymous2535k·Updated 1mo ago

Available on 1 platform

Description

A collection of bug candidates identified through metamorphic testing of tool-using LLM agents. The dataset is an artifact for the AgentMorph paper, authored by Anonymous2535k and last updated on May 7, 2026. It contains cleaned Stage 3 bug candidates derived from mutated task trajectories.

Use Cases

Benchmarking LLM agent robustness based on trajectory-level metamorphic testing
Identifying failure patterns in tool-using agents based on invariant violations
Developing new testing methodologies for AI agents based on intent-preserving task mutations

Strengths

Focuses on a specific testing methodology (metamorphic testing) for LLM agents
Provides cleaned bug candidates from Stage 3 of the AgentMorph process
Serves as a direct artifact for a research paper, indicating a defined purpose

Limitations

Column-level documentation is absent; field semantics must be inferred after download
Row count is unknown, which may limit suitability assessment
Description metadata is limited; actual data quality requires manual inspection after download

Provenance

Source: huggingface
Collection Method: Generated via metamorphic testing of tool-using LLM agents, where tasks are mutated to preserve intent and trajectories are compared.
Freshness: Last updated 2026-05-07 06:17:51; freshness should be verified

Tabular Bug Candidates Llm Agents Tool Use Benchmark Metamorphic Testing

Related Datasets

Quality Score

D38

Description

Source

Reputation

Quality Score

D38

Description

Source

Reputation

Access

Community

1 likes

0 views

Dataset Info

Author: Anonymous2535k
Created: May 7, 2026
Updated: May 7, 2026
Last synced: May 14, 2026

Access

Community

1 likes

0 views

Dataset Info

Author: Anonymous2535k
Created: May 7, 2026
Updated: May 7, 2026
Last synced: May 14, 2026

AgentMorph Bugs V0.1: Trajectory-Level Metamorphic Testing Candidates

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info