DiffSVCBaseModel / README.md
HuanLin's picture
Update README.md
0a6bfed
|
raw
history blame
1.67 kB
---
tags:
- DiffSVC
- pre-trained_model
- basemodel
- diff-svc
license: "gpl"
datasets:
- 512rc_50k
- 512rc_80k
- 512rc_100k
---
**English** | [简体中文](./README_CN.md)
# DiffSVCBaseModel
A Diff-SVC base model for all kind of voice
## How to use?
1. Choose and download this model
2. Fill your config and put your datasets into ```(diffsvc-root)/data/raw/{speaker_name}/```
3. Throw this base model(only .ckpt file) into ```(diffsvc-root)/checkpoints/{speaker_name}```
4. Then start preprocessing and training as usual
## How much data do you use?
I use 2 public datasets(opencpop ,m4singer),40h+ audio in total.
## I want to train my own base model!
OK, you can download [this bianry file](./BaseModelBinary.tar.gz).
## Download
| Version | URL | Reference value of lr |
| -------------- | ------------------------------------ | --------------------- |
| 384rc,50k_step | [Click here](./384rc_50k_step.zip) | 0.0016 |
| 384rc,80k_step | [Click here](./384rc_80k_step.zip) | 0.0032 |
| 384rc,100k_step | [Click here](./384rc_100k_step.zip) | 0.0032 |
More coming soon...
## Repos
| Repo | URL |
| --------------- | ---------------------------------------------------- |
| Diff-SVC | [Click here](https://github.com/prophesier/diff-svc) |
| 44.1KHz Vocoder | [Click here](https://openvpi.github.io/vocoders) |
| M4Singer | [Click here](https://github.com/M4Singer/M4Singer) |
| OpenCPOP | [Click here](https://github.com/wenet-e2e/opencpop) |
> rc: residual_channels