DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

Eval Cards Backend: Pre-Computed AI Model Evaluation Data | DataSalon

Home NLP & TextEval Cards Backend: Pre-Computed AI Model Evaluation Data

NLP & Text

Eval Cards Backend: Pre-Computed AI Model Evaluation Data

Name: Eval Cards Backend: Pre-Computed AI Model Evaluation Data
Creator: evaleval
Published: 2026-04-08T14:44:09
Keywords: Machine Learning, Benchmark Evaluation, Model Performance, Benchmark, Ai Assessment, Tabular, Synthetic

by evaleval·Updated 19d ago

Available on 1 platform

Description

The Eval Cards Backend Dataset contains pre-computed evaluation data for 5,678 models across 798 benchmarks. Generated by the eval-cards backend pipeline, it powers the Eval Cards frontend and includes 1,321 metric-level evaluations. The dataset was last generated on May 5, 2026.

Use Cases

Compare model performance across different benchmarks based on the 798 benchmark evaluations.
Analyze metric-level results for specific models based on the 1,321 metric-level evaluations.
Track evaluation trends over time based on the pipeline-generated data.
Validate new model outputs against established benchmark results.

Strengths

Includes evaluations for 5,678 distinct models.
Covers 798 different benchmarks.
Provides 1,321 metric-level evaluation records.
Contains metadata for 240 benchmark cards.

Limitations

Column-level documentation is absent; field semantics must be inferred after download.
Row count is unknown, which may limit suitability assessment.
Last updated 2026-06-03 09:29:44; freshness should be verified.

Provenance

Source: Generated by the eval-cards backend pipeline.
Collection Method: Pre-computed evaluation data.
Freshness: Last generated 2026-05-05T11:30:42.961096Z

Tabular Machine Learning Benchmark Evaluation Model Performance Benchmark Ai Assessment Synthetic

Related Datasets

Quality Score

C42

Description

Source

Reputation

Quality Score

C42

Description

Source

Reputation

Access

Community

22.0K downloads

1 likes

0 views

Dataset Info

Author: evaleval
Created: Apr 8, 2026
Updated: Jun 3, 2026
Last synced: Jun 10, 2026

Access

Community

22.0K downloads

1 likes

0 views

Dataset Info

Author: evaleval
Created: Apr 8, 2026
Updated: Jun 3, 2026
Last synced: Jun 10, 2026

Eval Cards Backend: Pre-Computed AI Model Evaluation Data

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info