Sign in to view source links and access this dataset
Description
Patent documents from the China National Intellectual Property Administration, including full text descriptions and claims. The dataset was uploaded by user jiaqianjing and last updated on Hugging Face in June 2023. It contains fields for patent ID, publication date, title, applicant, inventors, summary, and full text of the description and claims.
Use Cases
Patent novelty analysis based on the full text description and claims.
Trend analysis of technological innovation based on publication dates and applicant information.
Named entity recognition for extracting inventors and assignees from the text fields.
Text summarization experiments using the provided summaries and full descriptions.
Legal text mining for studying claim language and scope.
Strengths
Includes full text of patent descriptions and claims, which is a rich source for NLP.
Provides structured metadata fields like patent ID, dates, and applicant names.
Limitations
Description metadata is limited; actual data quality requires manual inspection after download.
Row count is unknown, which may limit suitability assessment.
Column-level documentation is absent; field semantics must be inferred after download.
Provenance
Source
China National Intellectual Property Administration (CNIPA).
Freshness
Last updated 2023-06-29 06:14:59; freshness should be verified.
Geography
China
License restrictions specify use only for research purposes, prohibiting commercial or harmful use.