Hatrang Voice 4H: Vietnamese Speech Dataset for TTS Model Fine-Tuning

Name: Hatrang Voice 4H: Vietnamese Speech Dataset for TTS Model Fine-Tuning
Creator: quocs
Published: 2026-02-10T04:36:52
Keywords: Size Categories1 Kn10 K, Text To Speech, Task Categoriestext To Speech, Speech Data, AUDIOFOLDER, Speech Synthesis, Modalitytext, Librarymlcroissant, Vietnamese, Librarydatasets, Text, Audio, Regionus, Audio Transcription, Neucodec, Task Categoriesautomatic Speech Recognition, Languagevi

by quocsUpdated 5mo ago

Available on 1 platform

Sign in to view source links and access this dataset

Description

A Vietnamese text-to-speech dataset containing 1,805 paired audio recordings and text transcriptions for fine-tuning VieNeu-TTS models. The dataset was created by author 'quocs' and last updated on February 10, 2026. Audio files are in WAV format at 24kHz, mono, with 16-bit PCM encoding.

Use Cases

Fine-tuning Vietnamese text-to-speech models based on paired audio and text data.
Training automatic speech recognition systems for Vietnamese based on audio transcriptions.
Developing or benchmarking neural codec models for Vietnamese speech synthesis.
Creating synthetic Vietnamese speech for applications based on the described audio-text pairs.

Strengths

Contains 1,805 paired audio and text samples for training.
Audio recordings are in a consistent format: WAV, 24kHz, mono, 16-bit PCM.

Limitations

Row count is unknown, which may limit suitability assessment.
Column-level documentation is absent; field semantics must be inferred after download.
Description metadata is limited; actual data quality requires manual inspection after download.

Provenance

Source: huggingface
Collection Method: Likely contains paired audio recordings and Vietnamese text transcriptions.
Time Range: null
Freshness: Last updated 2026-02-10 05:00:45; freshness should be verified.
Geography: null

null

Text Audio AUDIOFOLDER Size Categories1 Kn10 K Text To Speech Task Categoriestext To Speech Speech Data Speech Synthesis Modalitytext Librarymlcroissant Vietnamese Librarydatasets Regionus Audio Transcription Neucodec Task Categoriesautomatic Speech Recognition Languagevi

Related Datasets

Quality Score

C41

Description

42

Source

41

Reputation

47

Access

26

Community

2.3K downloads

1 likes

0 views

Dataset Info

Author: quocs
Created: Feb 10, 2026
Updated: Feb 10, 2026
Last synced: Apr 5, 2026

Access

26

Community

2.3K downloads

1 likes

0 views

Dataset Info

Author: quocs
Created: Feb 10, 2026
Updated: Feb 10, 2026
Last synced: Apr 5, 2026

Hatrang Voice 4H: Vietnamese Speech Dataset for TTS Model Fine-Tuning

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info