VITS fine-tuning
Created on June 30|Last edited on July 5
Comment
Test samples
(Samples synthesized from text without reference audio)
Runs:
- vits-ljs-freeman-angry - fine-tune all weights from LJSpeech checkpoint on angry subset of Morgan dataset
- vits-vctk-freeman-angry - fine-tuning with frozen text_encoder from VCTK checkpoint on angry subset of Morgan dataset
TestAudios/0-audio
VITS
2
Combining Morgan with VCTK
Each sample represents different "mood"
vits-vctk-freeman_x10 is continued from last checkpoint of vits-vctk-freeman with "Freeman" data upsampled to be 10 times more freequent.
Run set
2
Add a comment