Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
147 task cards form the metadata layer of the Agents Last Exam benchmark for evaluating computer-use agents. This version 1.0 release includes titles, prompts, taxonomy, and input-file descriptors for each task. The dataset is published by agents-last-exam on HuggingFace and was last updated on June 5, 2026.
This is a metadata-only release; the actual task input data is a separate companion dataset.