Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
χ-Bench (Chi-Bench) is a benchmark dataset for evaluating AI agents on end-to-end U.S. healthcare workflows. It was created by author 'actava' and last updated on 2026-05-19. The dataset provides task fixtures across three long-horizon domains: provider prior authorization, payer utilization management, and population care management.
License is unknown; terms of use must be verified before application.