Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
A dataset containing ground-truth faithfulness labels for chains of thought (CoTs), used for evaluating CoT faithfulness metrics. The dataset was created by author yoavgurarieh and last updated on May 16, 2026. It is constructed from tasks whose outputs reveal which intermediate computations must have produced them, then labeling CoTs against those computations.
License is unknown; restrictions should be verified before use.