DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

Wenetspeech Wu Bench: Wu Dialect Speech Processing Benchmark | DataSalon

Home Speech & AudioWenetspeech Wu Bench: Wu Dialect Speech Processing Benchmark

Speech & Audio

Wenetspeech Wu Bench: Wu Dialect Speech Processing Benchmark

Name: Wenetspeech Wu Bench: Wu Dialect Speech Processing Benchmark
Creator: ASLP-lab
Published: 2026-01-29T15:14:00
Keywords: Benchmark, Multilingual, Speech Processing, Audio, Speech Recognition, Wu Dialect

by ASLP-lab·Updated 5mo ago

Available on 1 platform

Description

Wu dialect speech data provides a manually curated benchmark for multiple speech processing tasks. It includes 9.75 hours of Wu dialect ASR data, covering Shanghainese, Suzhounese, and Mandarin code-mixed speech. The benchmark was created by ASLP-lab and updated in February 2026.

Use Cases

Wu dialect automatic speech recognition (ASR) based on Shanghainese and Suzhounese audio
Wu-to-Mandarin automatic speech translation (AST) based on described speech translation tasks
Speaker attribute analysis based on benchmark's speaker attribute evaluation
Speech emotion recognition based on benchmark's emotion recognition evaluation
Wu dialect text-to-speech (TTS) and instruct TTS based on benchmark's TTS tasks

Strengths

First publicly available, manually curated benchmark for Wu dialect speech processing
ASR component includes 9.75 hours of audio
Benchmark covers multiple tasks: ASR, AST, speaker attributes, emotion recognition, TTS, and instruct TTS

Limitations

Description metadata is limited; actual data quality requires manual inspection after download
Column-level documentation is absent; field semantics must be inferred after download

Provenance

Source: ASLP-lab
Collection Method: Manually curated benchmark
Freshness: Last updated 2026-02-08 17:11:54; freshness should be verified
Geography: Wu dialect regions, likely including Shanghai and Suzhou

Audio Multilingual Benchmark Speech Processing Speech Recognition Wu Dialect

Related Datasets

Quality Score

C43

Description

Source

Reputation

Quality Score

C43

Description

Source

Reputation

Access

Community

404 downloads

3 likes

0 views

Dataset Info

Author: ASLP-lab
Created: Jan 29, 2026
Updated: Feb 8, 2026
Last synced: Jul 25, 2026

Access

Community

404 downloads

3 likes

0 views

Dataset Info

Author: ASLP-lab
Created: Jan 29, 2026
Updated: Feb 8, 2026
Last synced: Jul 25, 2026

Wenetspeech Wu Bench: Wu Dialect Speech Processing Benchmark

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info