Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
HLE-Verified is a lightweight, evaluation-ready reformatting of the benchmark created by the Skylenage Team. The original work, 'HLE-Verified: A Systematic Verification and Structured Revision of Humanity's Last Exam', was authored by Weiqi Zhai et al. and is associated with arXiv paper 2602.13964. The dataset was last updated on February 28, III.
License is unknown; users must verify the license terms from the original source or dataset page before use.