Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
BilgeAI is a collection of Turkish text datasets for language model training, created by author vural2123 and last updated on March 28, 2026. The repository is structured into separate folders for instruction tuning and raw text pretraining. Each folder contains JSONL files with specific formats for different training tasks.
License is unknown; users must verify terms before use.