Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
This text dataset documents 10 failure cases and 32 stress-test prompts for the Qwen3.5-2B-Base model, authored by Elshawaf1 in March 2026. It maps specific model blind spots to identify training opportunities for future fine-tuning.
Experiments were conducted in Google Colab; see the Hugging Face dataset page for full reproduction details and MIT license terms.