WelkinFang commited on
Commit
459ac19
1 Parent(s): 408ce0f

Update README.md of svc

Browse files

1. Fixed the provided demo commands
2. Added the guide of vocoder downloading.

Files changed (1) hide show
  1. README.md +12 -5
README.md CHANGED
@@ -33,21 +33,27 @@ git lfs install
33
  git clone https://huggingface.co/amphion/singing_voice_conversion
34
  ```
35
 
36
- ### Step2: Clone the Amphion's Source Code of GitHub
 
 
 
 
 
37
  ```bash
38
  git clone https://github.com/open-mmlab/Amphion.git
39
  ```
40
 
41
- ### Step3: Specify the checkpoint's path
42
  Use the soft link to specify the downloaded checkpoint in first step:
43
 
44
  ```bash
45
  cd Amphion
46
- mkdir ckpts/svc
47
- ln -s ../singing_voice_conversion/vocalist_l1_contentvec+whisper ckpts/svc/vocalist_l1_contentvec+whisper
 
48
  ```
49
 
50
- ### Step4: Conversion
51
 
52
  You can follow [this recipe](https://github.com/open-mmlab/Amphion/tree/main/egs/svc/MultipleContentsSVC#4-inferenceconversion) to conduct the conversion. For example, if you want to make Taylor Swift sing the songs in the `[Your Audios Folder]`, just run:
53
 
@@ -57,6 +63,7 @@ sh egs/svc/MultipleContentsSVC/run.sh --stage 3 --gpu "0" \
57
  --infer_expt_dir "ckpts/svc/vocalist_l1_contentvec+whisper" \
58
  --infer_output_dir "ckpts/svc/vocalist_l1_contentvec+whisper/result" \
59
  --infer_source_audio_dir [Your Audios Folder] \
 
60
  --infer_target_speaker "vocalist_l1_TaylorSwift" \
61
  --infer_key_shift "autoshift"
62
  ```
 
33
  git clone https://huggingface.co/amphion/singing_voice_conversion
34
  ```
35
 
36
+ ### Step2: Download the vocoder checkpoint
37
+ ```bash
38
+ git clone https://huggingface.co/amphion/BigVGAN_singing_bigdata
39
+ ```
40
+
41
+ ### Step3: Clone the Amphion's Source Code of GitHub
42
  ```bash
43
  git clone https://github.com/open-mmlab/Amphion.git
44
  ```
45
 
46
+ ### Step4: Specify the checkpoints' path
47
  Use the soft link to specify the downloaded checkpoint in first step:
48
 
49
  ```bash
50
  cd Amphion
51
+ mkdir -p ckpts/svc
52
+ ln -s "$(realpath ../singing_voice_conversion/vocalist_l1_contentvec+whisper)" ckpts/svc/vocalist_l1_contentvec+whisper
53
+ ln -s "$(realpath ../BigVGAN_singing_bigdata/bigvgan_singing)" pretrained/bigvgan_singing
54
  ```
55
 
56
+ ### Step5: Conversion
57
 
58
  You can follow [this recipe](https://github.com/open-mmlab/Amphion/tree/main/egs/svc/MultipleContentsSVC#4-inferenceconversion) to conduct the conversion. For example, if you want to make Taylor Swift sing the songs in the `[Your Audios Folder]`, just run:
59
 
 
63
  --infer_expt_dir "ckpts/svc/vocalist_l1_contentvec+whisper" \
64
  --infer_output_dir "ckpts/svc/vocalist_l1_contentvec+whisper/result" \
65
  --infer_source_audio_dir [Your Audios Folder] \
66
+ --infer_vocoder_dir "pretrained/bigvgan_singing" \
67
  --infer_target_speaker "vocalist_l1_TaylorSwift" \
68
  --infer_key_shift "autoshift"
69
  ```