Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
Pile Uncopyrighted is a version of The Pile dataset with copyrighted content removed to respect author rights. Created by yanjx21, it was last updated on April 1, 2025. The dataset is intended for training future large language models while adhering to copyright law.
License is listed as 'other' on the platform; specific terms must be checked on the dataset page.