Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
ShahzebKhoso's repository hosts raw evaluation metrics and execution telemetry logs from running the Mostly Basic Python Problems (MBPP) benchmark against the Qwen 2.5 Coder 3B parameter model. The data captures a specific evaluation designed to chart the transition between hyper-lightweight edge models and larger desktop-class variants. It was last updated on 2026-05-23.
License is unknown; terms of use must be verified before application.