OpenVerification1: Large-Scale Dataset for LLM Output Verification

Name: OpenVerification1: Large-Scale Dataset for LLM Output Verification
Creator: ReexpressAI
Published: 2025-08-02T06:21:04
Keywords: Binary Classification, Text, Llm Verification, Large Scale, Instruction Following

by ReexpressAIUpdated 2mo ago

Available on 1 platform

Sign in to view source links and access this dataset

Description

ReexpressAI created OpenVerification1, the first large-scale, open-source dataset for research on LLM output verification and uncertainty quantification. The dataset, last updated on 2026-04-25, is designed for binary classification of whether a model's response correctly addresses a given prompt or question.

Use Cases

Train binary classifiers for LLM output verification based on prompt-response pairs.
Research uncertainty quantification methods for model responses.
Benchmark the reliability of instruction-following in language models.
Develop methods to automatically detect incorrect or hallucinated model answers.

Strengths

Described as the first large-scale, open-source dataset for this specific research topic.
Provides binary labels (0/1) for classifying the correctness of model responses.

Limitations

Row count, column definitions, and file formats are unknown, limiting suitability assessment.
Column-level documentation is absent; field semantics must be inferred after download.
The description is incomplete, referencing a full description on an external page.

Provenance

Source: ReexpressAI via Hugging Face.
Collection Method: Method of data gathering is not specified in the provided description.
Time Range: Temporal coverage is not specified.
Freshness: Last updated 2026-04-25 17:42:54.
Geography: Spatial coverage is not specified.

License is unknown; users must verify licensing terms before use.

Text Binary Classification Llm Verification Large Scale Instruction Following

Related Datasets

Quality Score

D37

Description

39

Source

36

Reputation

45

Access

26

Community

775 downloads

1 likes

0 views

Dataset Info

Author: ReexpressAI
Created: Aug 2, 2025
Updated: Apr 25, 2026
Last synced: May 14, 2026

Access

26

Community

775 downloads

1 likes

0 views

Dataset Info

Author: ReexpressAI
Created: Aug 2, 2025
Updated: Apr 25, 2026
Last synced: May 14, 2026

OpenVerification1: Large-Scale Dataset for LLM Output Verification

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info