Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
An evaluation dataset probing 18 Knowledge Graph-style reasoning tasks on the Qwen/Qwen3.5-2B-Base model. It was created by chayma-rhaiem and last updated on March 8, 2026. The dataset tests the model in its raw base form across parametric memory, standard grounded reasoning, and advanced grounded reasoning tasks.
License is unknown; terms of use must be verified before application.