Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
LibriHeavy TTS 3 is an improved version of the LibriHeavy dataset, designed specifically for text-to-speech training quality. It is built on a 50,000-hour labeled ASR corpus derived from LibriLight, with audio encoded using the Opus 68kbps codec. The dataset, authored by brthor and last updated in April 2026, focuses on providing better audio and text supervision quality.
The dataset is described as a work in progress (WIP) and not released yet, meaning it may not be available for download or use.