Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
SGI-Bench is a benchmark for evaluating Scientific General Intelligence in large language models across the full inquiry cycle. It spans 10 scientific disciplines and contains more than 1,000 expert-curated samples inspired by Science's 125 Big Questions. The dataset was created by InternScience and last updated in June 2026.
License is unknown; terms of use must be verified before application.