Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
ReCAP-187K-SFT contains supervised fine-tuning data for training multimodal GUI agents to solve CAPTCHAs. The dataset is structured in Qwen3-style conversation format and includes references to screenshot images from interaction trajectories. It was created by ReCAP-Agent and last updated in March 2026.
Image data is stored in split archive shards (archives/*.tar.gz.part-*), requiring download and assembly; see the full dataset page for usage instructions.