DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

CoDA-Bench: Code and Data-Intensive AI Agent Benchmark | DataSalon

Home NLP & TextCoDA-Bench: Code and Data-Intensive AI Agent Benchmark

NLP & Text

CoDA-Bench: Code and Data-Intensive AI Agent Benchmark

Name: CoDA-Bench: Code and Data-Intensive AI Agent Benchmark
Creator: RUC-DataLab
Published: 2026-05-29T04:41:01
Keywords: Evaluation, Benchmark, Ai Agents, Code Intelligence, Text, Data Intensive

by RUC-DataLab·Updated 12d ago

Available on 1 platform

Description

CoDA-Bench is a benchmark created by RUC-DataLab to evaluate AI agents on code and data-intensive tasks in realistic environments. Unlike benchmarks providing oracle data directly, it requires agents to discover relevant data among hundreds of semantically similar files. The dataset was last updated on June 16, 2026.

Use Cases

Benchmarking AI agent performance on data discovery tasks based on the requirement to find relevant files among hundreds of semantically similar ones.
Evaluating code intelligence in realistic data-intensive environments based on the benchmark's joint focus on code and data intelligence.
Training or fine-tuning code-capable AI models on tasks that integrate data retrieval and code generation.

Strengths

Designed as the first benchmark to jointly evaluate code intelligence and data intelligence of AI agents.
Simulates realistic data-intensive environments requiring agents to discover data rather than using oracle data.

Limitations

Description metadata is limited; actual data quality requires manual inspection after download.
Column-level documentation is absent; field semantics must be inferred after download.
Row count and file formats are unknown, which may limit suitability assessment.

Provenance

Source: RUC-DataLab
Freshness: Last updated 2026-06-16 03:08:58

License is unknown; users should verify licensing terms before use.

Text Evaluation Benchmark Ai Agents Code Intelligence Data Intensive

Related Datasets

Quality Score

D38

Description

Source

Reputation

Quality Score

D38

Description

Source

Reputation

Access

Community

119 downloads

1 likes

0 views

Dataset Info

Author: RUC-DataLab
Created: May 29, 2026
Updated: Jun 16, 2026
Last synced: Jun 22, 2026

Access

Community

119 downloads

1 likes

0 views

Dataset Info

Author: RUC-DataLab
Created: May 29, 2026
Updated: Jun 16, 2026
Last synced: Jun 22, 2026

CoDA-Bench: Code and Data-Intensive AI Agent Benchmark

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info