Sovits4.0 768Vec Layer12

Name: Sovits4.0 768Vec Layer12
Creator: ms903
Published: 2023-04-16T12:42:24
Keywords: Regionus

by ms903Updated 3y ago

Available on 1 platform

Sign in to view source links and access this dataset

Description

6 pre-trained base models for SoVITS 4.0 voice conversion, featuring 768-dimensional vectors and layer 12 configurations. These models were trained on the m4singer and vctk datasets, reaching up to 320,000 training steps with loss values as low as 14.1.

Use Cases

Fine-tune a singing voice conversion model using the m4singer-trained base weights
Initialize a SoVITS 4.0 training session by renaming the 216k step checkpoint to G_0.pth and D_0.pth
Compare voice synthesis quality between the 14.75 loss (294k steps) and 14.1 loss (144k steps) models

Strengths

Includes checkpoints at 100k, 216k, and 320k training steps
Features a high-performance model trained on NVIDIA A10 hardware with a loss of 14.1
Utilizes 768-dimensional vector embeddings from layer 12 of the feature extractor
Requires renaming files to D_0.pth and G_0.pth for integration into the SoVITS 4.0 framework

Regionus

Related Datasets

Quality Score

D29

Description

24

Source

36

Reputation

30

Access

22

Community

611 downloads

60 likes

0 views

Dataset Info

Author: ms903
Created: Apr 16, 2023
Updated: Jun 3, 2023
Last synced: Apr 28, 2026

Access

22

Community

611 downloads

60 likes

0 views

Dataset Info

Author: ms903
Created: Apr 16, 2023
Updated: Jun 3, 2023
Last synced: Apr 28, 2026

Sovits4.0 768Vec Layer12

Description

Use Cases

Strengths

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info