FEVEROUS: 87,026 Claims Verified with Wikipedia Sentences and Tables

Name: FEVEROUS: 87,026 Claims Verified with Wikipedia Sentences and Tables
Creator: Rami Aly
Published: 2021-01-01T00:00:00
Keywords: Structured Unstructured Data, Big Data, Information Extraction, Extraction Chemistry, Computer Science, Chemistry, Wikipedia, Unstructured Data, Natural Language Processing, Fact Verification, Chromatography, Information Retrieval, Multimodal, Data Mining

by Rami Aly / University of Cambridge

Available on 1 platform

Sign in to view source links and access this dataset

Description

87,026 claims are annotated with supporting or refuting evidence from Wikipedia sentences and table cells. Each claim is labeled as supported, refuted, or not enough information based on this evidence. The dataset was created by Rami Aly at the University of Cambridge for fact verification research.

Use Cases

Train fact verification models based on annotated claims and evidence labels.
Benchmark information retrieval systems on extracting evidence from both text and tables.
Develop multimodal reasoning models based on the combination of unstructured text and structured tabular data.
Study the challenge of verifying claims where evidence is insufficient.

Strengths

87,026 verified claims provide a substantial scale for model training.
Evidence is drawn from both unstructured sentences and structured table cells in Wikipedia.
Each claim has a clear verdict label (supports, refutes, not enough information).

Limitations

Column-level documentation is absent; field semantics must be inferred after download.
Last update date is unknown; freshness unverified.
Data may reflect the temporal and topical bias inherent to the Wikipedia snapshot used.

Provenance

Source: University of Cambridge
Collection Method: Claims were annotated with evidence from Wikipedia.

Multimodal Structured Unstructured Data Big Data Information Extraction Extraction Chemistry Computer Science Chemistry Wikipedia Unstructured Data Natural Language Processing Fact Verification Chromatography Information Retrieval Data Mining

Related Datasets

Quality Score

D38

Description

27

Source

66

Reputation

18

Access

22

Community

0 views

Dataset Info

Author: Rami Aly
Org: University of Cambridge
Created: Jan 1, 2021
DOI

Access

22

Community

0 views

Dataset Info

Author: Rami Aly
Org: University of Cambridge
Created: Jan 1, 2021
DOI

FEVEROUS: 87,026 Claims Verified with Wikipedia Sentences and Tables

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info