Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
yuhuanstudio's Wikipedia Pretrain Zh dataset provides text from the Chinese Wikipedia, converted to Simplified Chinese. The data appears to be structured as JSON objects with 'title' and 'text' fields, containing article excerpts. The dataset was last updated on 2026-06-02.
License is unknown; users must verify licensing terms before commercial use.