DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

SmoothConv: Chinese Multi-Channel Conversational Speech with Expert Annotations | DataSalon

Home NLP & TextSmoothConv: Chinese Multi-Channel Conversational Speech with Expert Annotations

NLP & Text

SmoothConv: Chinese Multi-Channel Conversational Speech with Expert Annotations

Name: SmoothConv: Chinese Multi-Channel Conversational Speech with Expert Annotations
Creator: qualialabsAI
Published: 2026-05-28T13:52:27
Keywords: Conversational, Chinese, Human Annotated, Audio, Natural Language Processing, Multichannel

by qualialabsAI·Updated 2d ago

Available on 1 platform

Description

Chinese multi-channel conversational speech data with expert human annotations, developed by ASLP@NPU and QualiaLabs. It is part of the SmoothConv–DuplexConv corpus family, constructed from the same underlying conversational sources as the companion DuplexConv dataset.

Use Cases

Benchmarking speech recognition models based on high-fidelity human annotations
Training speech enhancement systems based on multi-channel conversational audio
Developing conversational AI agents based on annotated dialogue speech
Studying speaker diarization based on multi-channel conversational sources

Strengths

High-quality annotations provided by expert humans
Part of a corpus family with a companion dataset (DuplexConv) of 2,000 hours

Limitations

Description metadata is limited; actual data quality requires manual inspection after download
Column-level documentation is absent; field semantics must be inferred after download

Provenance

Source: ASLP@NPU and QualiaLabs
Collection Method: Constructed from underlying conversational sources; expert human annotations.
Freshness: Last updated 2026-06-12 04:48:12; freshness should be verified
Geography: Chinese language

Audio Chinese Conversational Human Annotated Natural Language Processing Multichannel

Related Datasets

Quality Score

C42

Description

Source

Reputation

Quality Score

C42

Description

Source

Reputation

Access

Community

4.2K downloads

9 likes

0 views

Dataset Info

Author: qualialabsAI
Created: May 28, 2026
Updated: Jun 12, 2026
Last synced: Jun 14, 2026

Access

Community

4.2K downloads

9 likes

0 views

Dataset Info

Author: qualialabsAI
Created: May 28, 2026
Updated: Jun 12, 2026
Last synced: Jun 14, 2026

SmoothConv: Chinese Multi-Channel Conversational Speech with Expert Annotations

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info