DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

Metacognitive: FINAL Bench Functional Reasoning | DataSalon

Home NeuroscienceMetacognitive: FINAL Bench Functional Reasoning

Neuroscience

Metacognitive: FINAL Bench Functional Reasoning

Name: Metacognitive: FINAL Bench Functional Reasoning
Creator: FINAL-Bench
Published: 2026-02-21T09:12:18
Keywords: Task Categoriestext Generation, Doi1057967hf7873, Librarypolars, Task Categoriesquestion Answering, Languageen, Size Categoriesn1 K, Modalitytext, Declarative Procedural Gap, Modalitydocument, Librarymlcroissant, Functional Metacognition, Librarydatasets, Benchmark, Error Recovery, Librarypandas, Regionus, Reasoning, JSON, Licenseapache 20, Self Correction

by FINAL-Bench·Updated 4mo ago

Available on 1 platform

Description

FINAL Bench is a functional metacognitive reasoning benchmark for Large Language Models containing fewer than 1,000 records, released by FINAL-Bench in early 2026. It shifts evaluation from final-answer accuracy to measuring an AI's ability to identify knowledge gaps and perform error recovery.

Use Cases

Evaluating LLM self-correction capabilities using error-recovery tasks
Measuring the declarative-procedural gap in reasoning models
Benchmarking functional metacognition against existing frontier models

Strengths

Targets functional metacognition and error recovery specifically
Apache-2.0 licensed for open research and commercial use
Includes a registered DOI (10.57967/hf/7873) for academic citation

Limitations

Small sample size of n<1,000 records
Lack of documented column headers in the provided metadata

Provenance

Source: FINAL-Bench (Frontier Intelligence Nexus for AGI-Level Verification)
Freshness: Last updated February 27, 2026.

The dataset is provided in JSON format and is compatible with the Hugging Face datasets library, pandas, and polars.

JSON Task Categoriestext Generation Doi1057967hf7873 Librarypolars Task Categoriesquestion Answering Languageen Size Categoriesn1 K Modalitytext Declarative Procedural Gap Modalitydocument Librarymlcroissant Functional Metacognition Librarydatasets Benchmark Error Recovery Librarypandas Regionus Reasoning Licenseapache 20 Self Correction

Related Datasets

Quality Score

D39

Description

Source

Reputation

Quality Score

D39

Description

Source

Reputation

Access

Community

1.4K downloads

76 likes

0 views

Dataset Info

Author: FINAL-Bench
Created: Feb 21, 2026
Updated: Feb 27, 2026
Last synced: Jul 25, 2026

Access

Community

1.4K downloads

76 likes

0 views

Dataset Info

Author: FINAL-Bench
Created: Feb 21, 2026
Updated: Feb 27, 2026
Last synced: Jul 25, 2026

Metacognitive: FINAL Bench Functional Reasoning

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info