Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
RealToxicityPrompts contains 100,000 English sentence snippets extracted from the web by the Allen Institute for AI in 2020. It was developed to provide a standardized benchmark for researchers to quantify and mitigate the risk of neural toxic degeneration in large language models.
Users should be aware that this dataset contains intentionally offensive content. It is provided under the Apache 2.0 license and is intended for research purposes only.