DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

BrowseComp-V3: A Benchmark for Multimodal Browsing Agents with 300 Samples | DataSalon

Home Multimodal & LLMBrowseComp-V3: A Benchmark for Multimodal Browsing Agents with 300 Samples

Multimodal & LLM

BrowseComp-V3: A Benchmark for Multimodal Browsing Agents with 300 Samples

Name: BrowseComp-V3: A Benchmark for Multimodal Browsing Agents with 300 Samples
Creator: Halcyon-Zhang
Published: 2026-02-13T16:20:11
Keywords: Librarypolars, Encrypted Data, Size Categoriesn1 K, Modalitytext, Librarymlcroissant, Modalityimage, Librarydatasets, Benchmark, Librarypandas, Regionus, JSON, Multimodal Benchmark, Browsing Agents, Search Trajectories, Multimodal

by Halcyon-Zhang·Updated 5mo ago

Available on 1 platform

Description

BrowseComp-V3 is a benchmark dataset containing 300 samples for evaluating multimodal browsing agents. It includes encrypted question-answer pairs, images, search trajectories, and sub-goals. The dataset was created by Halcyon-Zhang and last updated on February 13, —.

Use Cases

Benchmarking agent performance based on encrypted question-answer pairs
Training multimodal models using the provided image data
Analyzing web navigation strategies based on search trajectories
Developing task decomposition methods using sub-goals

Strengths

Contains 300 samples with multimodal components
Includes search trajectories and sub-goals for agent analysis
Provides decryption scripts for accessing the encrypted data

Limitations

Row count is unknown, which may limit suitability assessment
Column-level documentation is absent; field semantics must be inferred after download

Provenance

Source: Halcyon-Zhang
Collection Method: Likely curated for benchmarking purposes; specific collection method is not detailed.
Time Range: null
Freshness: Last updated 2026-02-13 16:22:12
Geography: null

Data is encrypted; requires running the provided decryption scripts before use.

Multimodal JSON Librarypolars Encrypted Data Size Categoriesn1 K Modalitytext Librarymlcroissant Modalityimage Librarydatasets Benchmark Librarypandas Regionus Multimodal Benchmark Browsing Agents Search Trajectories

Related Datasets

Quality Score

C41

Description

Source

Reputation

Quality Score

C41

Description

Source

Reputation

Access

Community

484 downloads

4 likes

0 views

Dataset Info

Author: Halcyon-Zhang
Created: Feb 13, 2026
Updated: Feb 13, 2026
Last synced: Apr 9, 2026

Access

Community

484 downloads

4 likes

0 views

Dataset Info

Author: Halcyon-Zhang
Created: Feb 13, 2026
Updated: Feb 13, 2026
Last synced: Apr 9, 2026

BrowseComp-V3: A Benchmark for Multimodal Browsing Agents with 300 Samples

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info