Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
IndustryCorpus Law is a Chinese-language text corpus focused on the legal domain, constructed by BAAI to address data quality issues in industry model training. The dataset was processed using 22 specialized data operators and was last updated in July 2024. Specific volume and size details are not provided in the available metadata.
Primary documentation is in Chinese; the dataset is specialized for the Chinese legal domain. License information is not provided.