CSS10: Single Speaker Speech Datasets for 10 Languages

Name: CSS10: Single Speaker Speech Datasets for 10 Languages
Creator: Kyubyong
Published: 2018-05-11T02:47:20
License: Apache-2.0
Keywords: Speech To Text, Audio

by KyubyongUpdated 6y ago

Available on 1 platform

Sign in to view source links and access this dataset

Description

10 single-speaker speech datasets covering 10 languages including German, Greek, Spanish, Finnish, French, Hungarian, Japanese, Dutch, Russian, and Chinese. Each language-specific subset contains audio recordings paired with text transcriptions for speech synthesis tasks.

Use Cases

Train neural text-to-speech (TTS) models using the audio and transcription pairs
Evaluate multi-lingual speech synthesis architectures across the 10 provided language subsets
Benchmark acoustic modeling performance on single-speaker datasets for diverse linguistic families

Strengths

Covers 10 distinct languages: German, Greek, Spanish, Finnish, French, Hungarian, Japanese, Dutch, Russian, and Chinese
Features single-speaker audio recordings for each language to ensure acoustic and prosodic consistency
Includes text transcriptions mapped to audio files for supervised speech synthesis training

Audio Speech To Text

Related Datasets

Quality Score

D23

Description

19

Source

19

Reputation

18

Access

57

Community

483 likes

0 views

Dataset Info

License: Apache-2.0
Author: Kyubyong
Created: May 11, 2018
Updated: Mar 6, 2020
Language: HTML
Last synced: Jun 14, 2026

Access

57

Community

483 likes

0 views

Dataset Info

License: Apache-2.0
Author: Kyubyong
Created: May 11, 2018
Updated: Mar 6, 2020
Language: HTML
Last synced: Jun 14, 2026

CSS10: Single Speaker Speech Datasets for 10 Languages

Description

Use Cases

Strengths

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info