Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
ScreenSpot provides over 1200 text instructions paired with screens from iOS, Android, macOS, Windows, and web environments for evaluating GUI grounding. Researchers from Nanjing University and the Shanghai AI Laboratory created this benchmark to test large multimodal models. The dataset was last updated in April 2024.
The full dataset description and specifics are hosted externally on the Hugging Face dataset page. License information is not provided in the input.