Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
DELEGATE52 is a benchmark dataset for evaluating large language models on long-horizon delegated document editing across 52 professional document domains. The dataset was developed by Microsoft to study the readiness of AI systems for delegated workflows, where knowledge workers instruct LLMs to edit documents on their behalf over long sessions. It was last updated on 2026-04-20.
License is unknown; users must verify terms before use.