HuanLin
/

DiffSVCBaseModel

pre-trained_model

Model card Files Files and versions Community

DiffSVCBaseModel / README.md

HuanLin's picture

Update README.md

0a6bfed almost 2 years ago

|

1.67 kB

	---
	tags:
	- DiffSVC
	- pre-trained_model
	- basemodel
	- diff-svc
	license: "gpl"
	datasets:
	- 512rc_50k
	- 512rc_80k
	- 512rc_100k
	---
	English \| [简体中文](./README_CN.md)
	# DiffSVCBaseModel

	A Diff-SVC base model for all kind of voice

	## How to use?

	1. Choose and download this model

	2. Fill your config and put your datasets into ```(diffsvc-root)/data/raw/{speaker_name}/```

	3. Throw this base model(only .ckpt file) into ```(diffsvc-root)/checkpoints/{speaker_name}```

	4. Then start preprocessing and training as usual

	## How much data do you use?

	I use 2 public datasets(opencpop ,m4singer),40h+ audio in total.

	## I want to train my own base model!

	OK, you can download [this bianry file](./BaseModelBinary.tar.gz).

	## Download


	\| Version \| URL \| Reference value of lr \|
	\| -------------- \| ------------------------------------ \| --------------------- \|
	\| 384rc,50k_step \| [Click here](./384rc_50k_step.zip) \| 0.0016 \|
	\| 384rc,80k_step \| [Click here](./384rc_80k_step.zip) \| 0.0032 \|
	\| 384rc,100k_step \| [Click here](./384rc_100k_step.zip) \| 0.0032 \|

	More coming soon...

	## Repos

	\| Repo \| URL \|
	\| --------------- \| ---------------------------------------------------- \|
	\| Diff-SVC \| [Click here](https://github.com/prophesier/diff-svc) \|
	\| 44.1KHz Vocoder \| [Click here](https://openvpi.github.io/vocoders) \|
	\| M4Singer \| [Click here](https://github.com/M4Singer/M4Singer) \|
	\| OpenCPOP \| [Click here](https://github.com/wenet-e2e/opencpop) \|

	> rc: residual_channels