Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
KletterMix is a large German-language text dataset released as sharded JSONL files. The full release combines deduplicated data with remaining scored examples, superseding a smaller review-time subset. It was created by AIML-TUDA and was last updated on June 4, 2026.
License is unknown, which is a critical restriction for use. Data is distributed across 125 sharded JSONL files, which may require specific handling for loading.