Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
876,205 tokens of multi-domain text data were prepared by author khazarai for knowledge distillation using the Qwen3.6-plus model. The dataset covers topics including coding, mathematics, finance, medicine, and economics. It was last updated in April 2026.
License is listed as Apache 2.0 in platform tags, but the raw description does not confirm it; verification is recommended. The dataset's internal structure (columns, file formats) is unspecified.