Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
IHBench evaluates post-interruption recovery in voice agents executing structured, multi-step workflows. The benchmark contains 45 scenarios to measure if an agent resumes correctly, addresses user interjections, and avoids re-delivering content. It was created by bosonai and last updated on Hugging Face in June 2026.
License is unknown; terms of use must be verified.