Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
ARC-Bench is a 55-topic benchmark for evaluating autonomous research agents. Each topic presents an open-ended research question with a structured briefing, spanning five distinct scientific domains. The dataset was created by AIMING-Lab-UNC and released in May 2026.
License is unknown; terms of use must be verified before application.