Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Encompassing 200,000 text documents from The Pile, scored for toxicity using the Perspective API in May 2022. It is balanced with 100,000 of the most toxic documents and 100,000 randomly sampled documents.
The full description and specific column details are available only on the Hugging Face dataset page. The license is unknown.