Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
A curated analysis artifact comparing inference runtimes for the Qwen3.6-35B-A3B model. It contains results from experiments on Windows CUDA, comparing clean MTP llama.cpp, QuinsZouls llama-next TurboQuant, and Atomic TurboQuant runs under a fixed 64k context. The dataset was created by sjakek and last updated on 2026-05-15.
License is unknown; terms of use must be verified before application.