Computer Use Large is a massive dataset containing 48,478 screen recording videos totaling approximately 12,300 hours of professional software usage. Sourced from the internet, the dataset is designed to facilitate research into GUI interaction and computer-use automation. It covers a variety of complex software environments including CAD tools, spreadsheets, and development environments.
Use Cases
- Training AI agents for GUI automation and computer use
- Action recognition in desktop software environments
- Software tutorial classification
- Developing computer vision models for interface navigation
Strengths
- Large-scale coverage with over 12,300 hours of footage
- Pre-processed to remove irrelevant content like intros and talking heads
- Diverse representation of professional software suites
Limitations
- Audio has been completely stripped from all recordings
- Specific metadata and column structures are not provided
Provenance
- Source
- markov-ai
- Collection Method
- Sourced from the internet with automated trimming and audio removal.
- Freshness
- Last updated March 12, 2026.
- Geography
- United States