Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
SLM 100M training checkpoints and data are hosted on Kaggle. The raw description suggests the dataset is likely related to training a language model on news and historical text. The specific content, size, and structure require verification after download.
License is unknown; users must verify terms before use.