Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
ScaleAI compiled 2,912 successful jailbreak prompts across 537 multi-turn conversations for the paper 'LLM Defenses Are Not Robust to Multi-Turn Human Jailbreaks'. The dataset includes metadata such as design choice comments from red teamers and the resulting attack success rate (ASR). It was last updated on the platform in September 2024.
License is listed as 'cc By Nc 40' (Creative Commons Attribution-NonCommercial 4.0), restricting commercial use.