DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

PhishNChips: 2,000 Email Stimuli for LLM Agent Security Benchmarking | DataSalon

Home Software Engineering & SecurityPhishNChips: 2,000 Email Stimuli for LLM Agent Security Benchmarking

Software Engineering & Security

PhishNChips: 2,000 Email Stimuli for LLM Agent Security Benchmarking

Name: PhishNChips: 2,000 Email Stimuli for LLM Agent Security Benchmarking
Creator: AreLit
Published: 2026-04-04T17:40:58
Keywords: Llm Benchmark, Email Security, Benchmark, Text, Llm Security, Large Scale, Phishing Detection, System Prompt

by AreLit·Updated 1mo ago

Available on 1 platform

Description

2,000 email stimuli and 220,000 adjudicated model evaluations comprise the PhishNChips benchmark for evaluating how system prompt configurations influence the security behavior of LLM-based email agents. The canonical v5.2 release was authored by AreLit and last updated on April 8, 2026.

Use Cases

Benchmarking LLM phishing detection performance based on system prompt variations
Analyzing the relationship between prompt configuration and agent security behavior based on the described stimuli
Training or fine-tuning email security models using the adjudicated evaluation data

Strengths

Large-scale benchmark with 2,000 distinct email stimuli
Includes 220,000 adjudicated model evaluations for analysis

Limitations

Column-level documentation is absent; field semantics must be inferred after download
Row count for the primary stimuli is known, but other dataset dimensions like file formats are unknown

Provenance

Source: AreLit on Hugging Face
Collection Method: Likely constructed for benchmarking purposes; specific collection method not detailed.
Time Range: null
Freshness: Last updated 2026-04-08 18:04:40
Geography: null

License is unknown; terms of use should be verified before download.

Text Llm Benchmark Email Security Benchmark Llm Security Large Scale Phishing Detection System Prompt

Related Datasets

Quality Score

D37

Description

Source

Reputation

Quality Score

D37

Description

Source

Reputation

Access

Community

28 downloads

1 likes

0 views

Dataset Info

Author: AreLit
Created: Apr 4, 2026
Updated: Apr 8, 2026
Last synced: Apr 14, 2026

Access

Community

28 downloads

1 likes

0 views

Dataset Info

Author: AreLit
Created: Apr 4, 2026
Updated: Apr 8, 2026
Last synced: Apr 14, 2026

PhishNChips: 2,000 Email Stimuli for LLM Agent Security Benchmarking

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info