--- license: cc-by-nc-4.0 language: - en --- Models trained from VITS-fast-fine-tuning (https://github.com/Plachtaa/VITS-fast-fine-tuning) - Three speakers: laoliang (老梁), specialweek, zhongli. - The model is trained on the C+J base model with 500 epochs. - Following the official instruction, we use a single long audio of laoliang (~5 minutes) with auxiliary data as training data. - After downloading models, you need to move finetune_speaker.json and G_latest.pth to /path/to/ VITS-fast-fine-tuning. - Finally, you can run your local gradio application via python VC_inference.py --model_dir ./G_latest.pth --share True