Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
A Korean language dataset constructed for supervised fine-tuning (SFT) of large language models as part of a Sungkyunkwan University industry-academic cooperation project. The dataset was created by preprocessing and filtering data from sources including Stanford Alpaca and OIG-Chip2 using ChatGPT-3.5 Turbo 16k to improve naturalness. The dataset page was last updated on 2023-09-25.
License is unknown; terms of use must be verified before application.