DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

Openclaw Zh Intents 50K: A Chinese Intent Classification Dataset | DataSalon

Home Government & LegalOpenclaw Zh Intents 50K: A Chinese Intent Classification Dataset

Government & Legal

Openclaw Zh Intents 50K: A Chinese Intent Classification Dataset

Name: Openclaw Zh Intents 50K: A Chinese Intent Classification Dataset
Creator: trytax
Published: 2026-03-13T16:42:25
Keywords: Size Categories10 Kn100 K, Intent Classification, Languagezh, Modalitytext, Chinese, Benchmark, Text Classification, Text, Chinese Nlp, Text, Regionus, Synthetic Text, Licensemit, Synthetic

by trytax·Updated 3mo ago

Available on 1 platform

Description

A dataset of 50,000 Chinese text samples for intent classification, created by author trytax. The data is synthetically generated and includes labels for intent and domain. It was last updated on March 13, 2026.

Use Cases

Train a Chinese intent classifier based on the labeled 'intent' field.
Benchmark prompt routing systems based on the labeled 'domain' field.
Test model performance on synthetic conversational data based on the 'source' field.
Evaluate classification models using the provided train/validation/test split.

Strengths

Contains 50,000 total samples.
Provides a fixed, reproducible split of 45,000 training, 2,500 validation, and 2,500 test samples.
Includes multiple annotation fields: text, intent, domain, and source.

Limitations

Data is synthetically generated, which may not reflect real-world user query distributions.
Column-level documentation is absent; field semantics must be inferred after download.

Provenance

Source: trytax on Hugging Face
Collection Method: Synthetic/rule-based generation.
Time Range: null
Freshness: Last updated 2026-03-13 16:43:27; freshness should be verified.
Geography: null

null

Text Chinese Size Categories10 Kn100 K Intent Classification Languagezh Modalitytext Benchmark Text Classification Chinese Nlp Regionus Synthetic Text Licensemit Synthetic

Related Datasets

Quality Score

D37

Description

Source

Reputation

Quality Score

D37

Description

Source

Reputation

Access

Community

30 downloads

2 likes

0 views

Dataset Info

Author: trytax
Created: Mar 13, 2026
Updated: Mar 13, 2026
Last synced: May 8, 2026

Access

Community

30 downloads

2 likes

0 views

Dataset Info

Author: trytax
Created: Mar 13, 2026
Updated: Mar 13, 2026
Last synced: May 8, 2026

Openclaw Zh Intents 50K: A Chinese Intent Classification Dataset

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info