Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
evalixa-ai's TBench Project Dataset contains adversarial benchmark tasks designed to evaluate large language models. The dataset was last updated on 2026-05-01. It is hosted on Hugging Face and focuses on reasoning, optimization, and code-related challenges.
License is unknown; users must verify permissions before use.