Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Agent Safety Bench (ASB) is a benchmark for evaluating the safety of tool-using LLM agents. Each example pairs a natural-language instruction with one or more sandboxed tool environments to measure whether an agent completes tasks without taking unsafe actions. The dataset was created by author 'aradhye' and was last updated on 2026-05-04.
The runtime environments (the Python classes the agent calls into) are hosted in a separate companion package named 'agent-safety-bench-envs'.