Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
CoDA-Bench is a benchmark created by RUC-DataLab to evaluate AI agents on code and data-intensive tasks in realistic environments. Unlike benchmarks providing oracle data directly, it requires agents to discover relevant data among hundreds of semantically similar files. The dataset was last updated on June 16, 2026.
License is unknown; users should verify licensing terms before use.