A reprocessed annotation release for WildGUI, a dataset built from large-scale Internet tutorial videos for GUI agent pretraining. The records were regenerated, cleaned, and reformatted to make the data easier to inspect, reuse, and reproduce. The dataset was last updated on June 12, 2026.
Use Cases
- Pretraining GUI automation agents based on annotated tutorial video data.
- Benchmarking GUI task performance based on cleaned and reformatted annotations.
- Studying human-computer interaction patterns based on large-scale video-derived records.
Strengths
- Annotations were regenerated and cleaned following a full annotation workflow.
- Data was reformatted to improve inspectability, reusability, and reproducibility.
Limitations
- Description metadata is limited; actual data quality requires manual inspection after download.
- Column-level documentation is absent; field semantics must be inferred after download.
- Row count is unknown, which may limit suitability assessment.
Provenance
- Source
- Video2GUI project
- Collection Method
- Built from large-scale Internet tutorial videos.
- Freshness
- Last updated 2026-06-12 19:14:59.