DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

Whisperspeech: Semantic and Acoustic Tokens for TTS Training | DataSalon

Home Speech & AudioWhisperspeech: Semantic and Acoustic Tokens for TTS Training

Speech & Audio

Whisperspeech: Semantic and Acoustic Tokens for TTS Training

Name: Whisperspeech: Semantic and Acoustic Tokens for TTS Training
Creator: collabora
Published: 2023-06-19T10:39:41
Keywords: Size Categories1 Kn10 K, Task Categoriestext To Speech, Languageen, Modalitytext, Librarymlcroissant, Librarydatasets, Text, Regionus, Licensemit

by collabora·Updated 2y ago

Available on 1 platform

Description

Supplying semantic and acoustic tokens for the LibriLight and LibriTTS English speech corpora, specifically formatted for training SPEAR TTS-like models. It features 24kHz EnCodec acoustic tokens at 6kbps and semantic tokens generated through a Whisper tiny VQ bottleneck trained on LibriLight subsets.

Use Cases

Train a text-to-speech model by mapping input text to the provided semantic tokens.
Synthesize high-fidelity audio by decoding the 24kHz EnCodec acoustic tokens.
Benchmark SPEAR TTS-like models using the pre-tokenized LibriLight small, medium, and large subsets.

Strengths

Includes 24kHz EnCodec acoustic tokens with 8 quantizers at a 6kbps bitrate.
Features semantic tokens generated using a Whisper tiny VQ bottleneck trained on LibriLight subsets.
Contains pre-processed data for the small, medium, and large subsets of the LibriLight corpus.
Provides specialized tokenized representations for the English-only LibriTTS dataset.

Text Size Categories1 Kn10 K Task Categoriestext To Speech Languageen Modalitytext Librarymlcroissant Librarydatasets Regionus Licensemit

Related Datasets

Quality Score

D36

Description

Source

Reputation

Quality Score

D36

Description

Source

Reputation

Access

Community

153 downloads

19 likes

0 views

Dataset Info

Author: collabora
Created: Jun 19, 2023
Updated: Oct 7, 2023
Last synced: Apr 28, 2026

Access

Community

153 downloads

19 likes

0 views

Dataset Info

Author: collabora
Created: Jun 19, 2023
Updated: Oct 7, 2023
Last synced: Apr 28, 2026

Whisperspeech: Semantic and Acoustic Tokens for TTS Training

Description

Use Cases

Strengths

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info