Spaces:
Runtime error
Runtime error
File size: 1,234 Bytes
d1b91e7 53fa903 d1b91e7 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 |
# Prepare Vocoder
We use [HiFi-GAN](https://github.com/jik876/hifi-gan) as the default vocoder.
## LJSpeech
### Use Pretrained Model
```bash
wget https://github.com/xx/xx/releases/download/pretrain-model/hifi_lj.zip
unzip hifi_lj.zip
mv hifi_lj checkpoints/hifi_lj
```
### Train Your Vocoder
#### Set Config Path and Experiment Name
```bash
export CONFIG_NAME=egs/datasets/audio/lj/hifigan.yaml
export MY_EXP_NAME=my_hifigan_exp
```
#### Prepare Dataset
Prepare dataset following [prepare_data.md](./prepare_data.md).
If you have run the `prepare_data` step of the acoustic
model (e.g., PortaSpeech and DiffSpeech), you only need to binarize the dataset for the vocoder training:
```bash
python data_gen/tts/runs/binarize.py --config $CONFIG_NAME
```
#### Training
```bash
CUDA_VISIBLE_DEVICES=0 python tasks/run.py --config $CONFIG_NAME --exp_name $MY_EXP_NAME --reset
```
#### Inference (Testing)
```bash
CUDA_VISIBLE_DEVICES=0 python tasks/run.py --config $PS_CONFIG --exp_name $MY_EXP_NAME --infer
```
#### Use the trained vocoder
Modify the `vocoder_ckpt` in config files of acoustic models (e.g., `egs/datasets/audio/lj/base_text2mel.yaml`) to $MY_EXP_NAME (e.g., `vocoder_ckpt: checkpoints/my_hifigan_exp`)
|