Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
A collection of a scored subset of 2.2 million documents from The Pile, processed through the Perspective API on May 18-20, 2022. It was created by tomekkorbak and provides toxicity annotations for text chunks.
Users likely need access to the original Pile dataset to obtain the document text corresponding to these scores.