Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
43 documented safety filter test incidents from Grok's image generation system on X, showing a 100% failure rate. The dataset records prompt text, output description, failure category, and date for incidents involving photorealistic images of minors, nonconsensual imagery, and political disinformation. It serves as the evidentiary basis for a 2026 academic audit paper by Dana Lefkowitz.
Dataset is very small (56.8 KB). Primary use is for audit analysis and AI safety research, not for training large-scale ML models.