Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
BrowseComp-V3 is a benchmark dataset containing 300 samples for evaluating multimodal browsing agents. It includes encrypted question-answer pairs, images, search trajectories, and sub-goals. The dataset was created by Halcyon-Zhang and last updated on February 13, —.
Data is encrypted; requires running the provided decryption scripts before use.