Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
PoFBench is a test-only benchmark designed to measure performance of policy-based custom filters in LLM-powered systems. It contains harmful content for AI filter evaluation, including biased expressions, crime-related scenarios, and jailbreak attempts. The dataset was created by SamsungSDS-Research and was last updated on 2026-05-06.
Dataset contains harmful content for AI filter evaluation; users should be aware of the content warning.