DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

PhoAudiobook: A High-Quality Vietnamese Speech Dataset for Zero-Shot TTS | DataSalon

Home Speech & AudioPhoAudiobook: A High-Quality Vietnamese Speech Dataset for Zero-Shot TTS

Speech & Audio

PhoAudiobook: A High-Quality Vietnamese Speech Dataset for Zero-Shot TTS

Name: PhoAudiobook: A High-Quality Vietnamese Speech Dataset for Zero-Shot TTS
Creator: thivux
Published: 2025-06-07T07:47:34
Keywords: Text To Speech, Speech Synthesis, Zero Shot Learning, Text, Audio, Large Scale, Vietnamese Language

by thivux·Updated 6mo ago

Available on 1 platform

Description

PhoAudiobook is a high-quality and large-scale Vietnamese speech dataset curated for zero-shot text-to-speech. The dataset construction and experimental results are detailed in the ACL 2025 paper 'Zero-Shot Text-to-Speech for Vietnamese' by Thi Vu, Linh The Nguyen, and Dat Quoc Nguyen. The dataset page was last updated on Hugging Face in January 2026.

Use Cases

Training zero-shot text-to-speech models based on the high-quality Vietnamese speech data.
Benchmarking TTS model performance for Vietnamese based on the described dataset scale and quality.
Researching cross-speaker voice synthesis based on the zero-shot learning focus mentioned in the description.

Strengths

Described as 'high-quality' in the dataset description.
Described as 'large-scale' in the dataset description.
Associated with a peer-reviewed ACL 2025 publication.

Limitations

Column-level documentation is absent; field semantics must be inferred after download.
Row count, file formats, and license are unknown, which may limit suitability assessment.

Provenance

Source: Hugging Face dataset by author 'thivux'.
Collection Method: Curated for zero-shot text-to-speech; details in the associated ACL 2025 paper.
Time Range: null
Freshness: Last updated 2026-01-01 03:52:48; freshness should be verified.
Geography: null

null

Text Audio Text To Speech Speech Synthesis Zero Shot Learning Large Scale Vietnamese Language

Related Datasets

Quality Score

D40

Description

Source

Reputation

Quality Score

D40

Description

Source

Reputation

Access

Community

2.4K downloads

41 likes

0 views

Dataset Info

Author: thivux
Created: Jun 7, 2025
Updated: Jan 1, 2026
Last synced: Jul 8, 2026

Access

Community

2.4K downloads

41 likes

0 views

Dataset Info

Author: thivux
Created: Jun 7, 2025
Updated: Jan 1, 2026
Last synced: Jul 8, 2026

PhoAudiobook: A High-Quality Vietnamese Speech Dataset for Zero-Shot TTS

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info