DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

Agent Safety Bench: A Benchmark for Evaluating Tool-Using LLM Agents | DataSalon

Home EducationAgent Safety Bench: A Benchmark for Evaluating Tool-Using LLM Agents

Education

Agent Safety Bench: A Benchmark for Evaluating Tool-Using LLM Agents

Name: Agent Safety Bench: A Benchmark for Evaluating Tool-Using LLM Agents
Creator: aradhye
Published: 2026-05-03T16:28:44
Keywords: Llm Benchmark, Agent Safety, Tool Use, Benchmark, Text, Safety Evaluation

by aradhye·Updated 1mo ago

Available on 1 platform

Description

Agent Safety Bench (ASB) is a benchmark for evaluating the safety of tool-using LLM agents. Each example pairs a natural-language instruction with one or more sandboxed tool environments to measure whether an agent completes tasks without taking unsafe actions. The dataset was created by author 'aradhye' and was last updated on 2026-05-04.

Use Cases

Benchmarking agent safety performance based on natural-language instructions paired with tool environments.
Developing safety guardrails for LLM agents based on the benchmark's task structure.
Comparing different agent architectures on their propensity for unsafe actions in sandboxed environments.

Strengths

The benchmark is specifically designed for the emerging domain of tool-using LLM agent safety.
It pairs natural-language instructions with sandboxed tool environments, providing a structured testbed.

Limitations

Column-level documentation is absent; field semantics must be inferred after download.
Row count is unknown, which may limit suitability assessment.
Last updated 2026-05-04 03:20:13; freshness should be verified.

Provenance

Source: huggingface
Freshness: Last updated 2026-05-04.

The runtime environments (the Python classes the agent calls into) are hosted in a separate companion package named 'agent-safety-bench-envs'.

Text Llm Benchmark Agent Safety Tool Use Benchmark Safety Evaluation

Related Datasets

Quality Score

D38

Description

Source

Reputation

Quality Score

D38

Description

Source

Reputation

Access

Community

82 downloads

1 likes

0 views

Dataset Info

Author: aradhye
Created: May 3, 2026
Updated: May 4, 2026
Last synced: May 21, 2026

Access

Community

82 downloads

1 likes

0 views

Dataset Info

Author: aradhye
Created: May 3, 2026
Updated: May 4, 2026
Last synced: May 21, 2026

Agent Safety Bench: A Benchmark for Evaluating Tool-Using LLM Agents

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info