Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
SGI-Bench is a benchmark for evaluating Scientific General Intelligence (SGI) of LLMs across the full inquiry cycle. It spans 10 scientific disciplines and contains more than 1,000 expert-curated samples inspired by Science's 125 Big Questions. The dataset was created by InternScience and was last updated on June 2, 2026.
License is unknown; users should verify terms before use.