Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
UVSKKR's dataset provides evaluation artifacts for the ICML 2026 Workshop paper "When the Chain of Thought Knows Better: Failure Modes in Multi-Turn Reasoning Models". It offers a granular, turn-level diagnostic of how distilled reasoning models behave under prolonged adversarial pressure. The dataset was last updated on June 4, 2026.
The full dataset description is hosted on an external page; complete metadata requires visiting the provided URL.