Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
SGI-Bench is a benchmark for evaluating Scientific General Intelligence in large language models. It contains more than 1,000 expert-curated samples spanning 10 scientific disciplines, aligned with the full inquiry cycle. The dataset was created by InternScience and last updated on 2026-06-02.
License is unknown; users should verify terms before use.