Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Humanity's Last Exam (HLE) is a multi-modal benchmark containing 2,500 questions across dozens of academic subjects, released by the Center for AI Safety and Scale AI in January 2026. It serves as a frontier-level evaluation suite designed to test the limits of human knowledge through closed-ended questions.
The authors explicitly request that users do not publicly share, re-upload, or distribute the dataset to protect benchmark integrity. It is provided in Parquet format.