Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
2,967,779 clone utterances across 2,971 English speakers, generated by the echo-tts synthesizer. The dataset was created by SynDataLab and last updated on 2026-04 25. It contains WAV audio at 44.1 kHz, stored in Parquet files, with each speaker represented by 10 voice-clone latents and 100 synthesized texts.
License is unknown, which may restrict usage. A companion 'refs set' contains the first utterance of each speaker.