Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
The Symbolic Pretraining Pile (SPT) is a dataset for symbolic and formal pre-training, mid-training, and supervised fine-tuning. It is procedurally generated on CPU and can be scaled to trillion tokens, with adjustable difficulty. The dataset was created by reasoning-core and last updated on March 23, 2026.
License is unknown; terms of use must be verified before download.