Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
269,453 adversarial prompt and model rollout pairs harvested for training jailbreak-resistant alignment models. The dataset was created by author memo-ozdincer and last updated on 2026-05-04. It contains approximately 225,000 harmful-side records and 44,000 benign-side over-refusal probes.
License is unknown; users should verify terms of use before downloading.