Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Osworld G provides a benchmark for computer-use grounding through UI decomposition and synthesis, released by xlang-ai as a NeurIPS 2025 Spotlight. It facilitates the training of Large Action Models (LAMs) by generating multimodal data that pairs visual GUI elements with natural language grounding instructions.
Users should refer to the xlang-ai GitHub repository for the specific synthesis scripts and evaluation frameworks associated with the NeurIPS 2025 publication.