Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
615,000 tokens of cleaned text data used for training the Crow 8B language model. The dataset was created by Crownelius and last updated on Hugging Face in March 2026. It consists of prompt-completion pairs with an average of 6.65 tokens per row.
License is unknown, which may restrict commercial or research use.