Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
LJ Speech contains 13,100 short audio clips of a single speaker reading from seven non-fiction books, totaling approximately 24 hours of English speech. Released by Keith Ito, the dataset provides expert-generated transcriptions for every recording to support speech synthesis and recognition tasks.
Audio is stored in .wav format and is not pre-converted to float32 arrays; users should use the soundfile library to map files to arrays before training. The dataset is released under the Unlicense (Public Domain).