Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
2,000 email stimuli and 220,000 adjudicated model evaluations comprise the PhishNChips benchmark for evaluating how system prompt configurations influence the security behavior of LLM-based email agents. The canonical v5.2 release was authored by AreLit and last updated on April 8, 2026.
License is unknown; terms of use should be verified before download.