TTS Pretrain 1M: 1 Million Synthetic Audio Samples Across 1000 Speakers

Name: TTS Pretrain 1M: 1 Million Synthetic Audio Samples Across 1000 Speakers
Creator: Aynursusuz
Published: 2026-02-24T23:36:21
Keywords: Text To Speech, Librarypolars, Librarydask, Modalityaudio, Size Categories1 Mn10 M, Speech Synthesis, Speaker Cloning, Modalitytext, Librarymlcroissant, Librarydatasets, Parquet, Audio, Regionus, Synthetic, Multimodal, Synthetic Audio

by AynursusuzUpdated 4mo ago

Available on 1 platform

Sign in to view source links and access this dataset

Description

One million synthetic audio samples for text-to-speech applications, generated across 1000 distinct speakers. The collection was created by Aynursusuz, with each speaker contributing 1000 samples derived from 100 texts and 10 voice clones. The dataset was last updated on Hugging Face on March 11, 2026.

Use Cases

Pre-training text-to-speech models based on the 1 million audio samples.
Training multi-speaker TTS systems based on the 1000 distinct speaker identities.
Developing voice cloning techniques based on the 10 clones per speaker mentioned in the description.
Benchmarking audio generation quality based on the 44.1 kHz WAV format samples.

Strengths

Contains 1,000,000 total audio samples.
Includes audio from 1000 distinct speakers.
Audio is provided at a 44.1 kHz sample rate in WAV format.

Limitations

Column-level documentation is absent; field semantics must be inferred after download.
Data may reflect synthetic generation bias inherent to the source method.
Description metadata is limited; actual data quality requires manual inspection after download.

Provenance

Source: Hugging Face, uploaded by Aynursusuz.
Collection Method: Synthetically generated for text-to-speech pretraining.
Freshness: Last updated 2026-03-11 17:24:44; freshness should be verified.

Audio Multimodal Parquet Text To Speech Librarypolars Librarydask Modalityaudio Size Categories1 Mn10 M Speech Synthesis Speaker Cloning Modalitytext Librarymlcroissant Librarydatasets Regionus Synthetic Synthetic Audio

Related Datasets

Quality Score

D35

Description

33

Source

36

Reputation

48

Access

22

Community

5.1K downloads

1 likes

0 views

Dataset Info

Author: Aynursusuz
Created: Feb 24, 2026
Updated: Mar 11, 2026

Access

22

Community

5.1K downloads

1 likes

0 views

Dataset Info

Author: Aynursusuz
Created: Feb 24, 2026
Updated: Mar 11, 2026

TTS Pretrain 1M: 1 Million Synthetic Audio Samples Across 1000 Speakers

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info