DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

Ko-WideSearch: Korean Web Agent Benchmark Questions | DataSalon

Home Machine LearningKo-WideSearch: Korean Web Agent Benchmark Questions

Machine Learning

Ko-WideSearch: Korean Web Agent Benchmark Questions

Name: Ko-WideSearch: Korean Web Agent Benchmark Questions
Creator: Minbyul
Published: 2026-06-22T04:57:17
Keywords: Korean Language, Benchmark, Question Answering, Text, Table Filling, Web Agent Benchmark

by Minbyul·Updated 7d ago

Available on 1 platform

Description

228 Korean-language questions designed to benchmark web agents on exhaustive enumeration tasks. Each task asks an agent to fill every attribute cell of a table by exhaustively enumerating a closed set. Gold answers, source URLs, and scoring details are withheld for a leakage-aware evaluation run privately against held-out data.

Use Cases

Benchmarking Korean web agents on exhaustive enumeration tasks based on the described table-filling objective.
Testing information extraction systems on structured knowledge completion tasks based on the described closed-set search.
Evaluating the breadth-search capabilities of AI models based on the benchmark's design for exhaustive attribute filling.

Strengths

Contains 228 specific benchmark questions.
Designed for leakage-aware evaluation with privately held gold answers.

Limitations

Column-level documentation is absent; field semantics must be inferred after download.
Row count is unknown, which may limit suitability assessment.
Gold answers and source URLs are withheld, preventing direct validation of model outputs.

Provenance

Source: huggingface
Freshness: Last updated 2026-06-22 05:02:04.

Gold answers, source URLs, set sizes, and the scoring pipeline are not part of the public release.

Text Korean Language Benchmark Question Answering Table Filling Web Agent Benchmark

Related Datasets

Quality Score

D37

Description

Source

Reputation

Quality Score

D37

Description

Source

Reputation

Access

Community

1 likes

0 views

Dataset Info

Author: Minbyul
Created: Jun 22, 2026
Updated: Jun 22, 2026
Last synced: Jun 29, 2026

Access

Community

1 likes

0 views

Dataset Info

Author: Minbyul
Created: Jun 22, 2026
Updated: Jun 22, 2026
Last synced: Jun 29, 2026

Ko-WideSearch: Korean Web Agent Benchmark Questions

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info