Loading...
Loading...
Available on 1 platform
Sign in to view source links and access this dataset
CommonVoice 22 speech data enhanced by Sidon and converted into DAC VAE latent representations. The dataset is provided by TTS-AGI and was last updated on March 22, 2026. Each sample includes original FLAC audio, a corresponding latent vector, and metadata.
License is unknown; users must verify licensing terms before use. Requires tools to handle .tar shards, .npy, and .flac files.