Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Onemillion Bench is a bilingual (English/Chinese) expert-level benchmark containing 400 entries across five professional domains, released by humanlaya-data-lab in March 2026. It utilizes weighted rubric-based grading criteria to evaluate language agents on analytical reasoning and instruction following within specialized fields.
The dataset is licensed under Apache-2.0 and is organized into Hugging Face subsets by domain; all data is located within the test split.