Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
A specialized parallel dataset engineered for Supervised Fine-Tuning of Large Language Models in zero-resource Turkic languages. The corpus, created by ansarzeinulla and last updated in June 2026, is designed to mitigate catastrophic forgetting during model adaptation to endangered languages. It contains high-fidelity Nogai-Russian translations of biblical text.
License is unknown, which may restrict commercial or research use.