Sign in to view source links and access this dataset
Description
GUIDE (GUI User Intent Detection Evaluation) is a benchmark for evaluating multimodal models on perceiving user behavior and inferring intent in open-ended GUI tasks. It consists of 67.5 hours of screen recordings from 120 novice user demonstrations with think-aloud narrations, across 10 software applications. The dataset was created by Saelyne Yang, Jaesang Yu, Yi-Hao Peng, Kevin, and others, and was last updated on Hugging Face in June 2026.
Use Cases
Benchmarking multimodal AI models on their ability to infer user intent based on screen recordings and narration.
Training models to provide real-time assistance in software applications based on observed user behavior.
Studying novice user interaction patterns across a variety of common software tools.