Sign in to view source links and access this dataset
Description
ServiceNow's GroundCUA dataset provides real UI screenshots paired with structured annotations for building multimodal computer use agents. It covers 87 software platforms across productivity, browser, creative, communication, development, and system utility categories. The dataset was last updated on December 24, 2025.
Use Cases
Train multimodal models for GUI understanding based on real UI screenshots.
Develop agents for software automation based on structured annotations of human demonstrations.
Benchmark computer use agents across diverse software platforms mentioned in the description.
Research human-computer interaction patterns from annotated demonstration data.