File size: 5,409 Bytes
528df8b
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
<div align="center">

<h1>Retrieval-based-Voice-Conversion-WebUI</h1>
VITS ๊ธฐ๋ฐ˜์˜ ๊ฐ„๋‹จํ•˜๊ณ  ์‚ฌ์šฉํ•˜๊ธฐ ์‰ฌ์šด ์Œ์„ฑ ๋ณ€ํ™˜ ํ”„๋ ˆ์ž„์›Œํฌ.<br><br>

[![madewithlove](https://img.shields.io/badge/made_with-%E2%9D%A4-red?style=for-the-badge&labelColor=orange
)](https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI)

<img src="https://counter.seku.su/cmoe?name=rvc&theme=r34" /><br>

[![Open In Colab](https://img.shields.io/badge/Colab-F9AB00?style=for-the-badge&logo=googlecolab&color=525252)](https://colab.research.google.com/github/RVC-Project/Retrieval-based-Voice-Conversion-WebUI/blob/main/Retrieval_based_Voice_Conversion_WebUI.ipynb)
[![Licence](https://img.shields.io/github/license/RVC-Project/Retrieval-based-Voice-Conversion-WebUI?style=for-the-badge)](https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI/blob/main/LICENSE)
[![Huggingface](https://img.shields.io/badge/๐Ÿค—%20-Spaces-yellow.svg?style=for-the-badge)](https://huggingface.co/lj1995/VoiceConversionWebUI/tree/main/)

[![Discord](https://img.shields.io/badge/RVC%20Developers-Discord-7289DA?style=for-the-badge&logo=discord&logoColor=white)](https://discord.gg/HcsmBBGyVk)

</div>

---

[**์—…๋ฐ์ดํŠธ ๋กœ๊ทธ**](https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI/blob/main/docs/Changelog_KO.md)

[**English**](../en/README.en.md) | [**ไธญๆ–‡็ฎ€ไฝ“**](../../README.md) | [**ๆ—ฅๆœฌ่ชž**](../jp/README.ja.md) | [**ํ•œ๊ตญ์–ด**](../kr/README.ko.md) ([**้Ÿ“ๅœ‹่ชž**](../kr/README.ko.han.md)) | [**Tรผrkรงe**](../tr/README.tr.md)

> [๋ฐ๋ชจ ์˜์ƒ](https://www.bilibili.com/video/BV1pm4y1z7Gm/)์„ ํ™•์ธํ•ด ๋ณด์„ธ์š”!

> RVC๋ฅผ ํ™œ์šฉํ•œ ์‹ค์‹œ๊ฐ„ ์Œ์„ฑ๋ณ€ํ™˜: [w-okada/voice-changer](https://github.com/w-okada/voice-changer)

> ๊ธฐ๋ณธ ๋ชจ๋ธ์€ 50์‹œ๊ฐ„ ๊ฐ€๋Ÿ‰์˜ ๊ณ ํ€„๋ฆฌํ‹ฐ ์˜คํ”ˆ ์†Œ์Šค VCTK ๋ฐ์ดํ„ฐ์…‹์„ ์‚ฌ์šฉํ•˜์˜€์œผ๋ฏ€๋กœ, ์ €์ž‘๊ถŒ์ƒ์˜ ์—ผ๋ ค๊ฐ€ ์—†์œผ๋‹ˆ ์•ˆ์‹ฌํ•˜๊ณ  ์‚ฌ์šฉํ•˜์‹œ๊ธฐ ๋ฐ”๋ž๋‹ˆ๋‹ค.

> ์ €์ž‘๊ถŒ ๋ฌธ์ œ๊ฐ€ ์—†๋Š” ๊ณ ํ€„๋ฆฌํ‹ฐ์˜ ๋…ธ๋ž˜๋ฅผ ์ดํ›„์—๋„ ๊ณ„์†ํ•ด์„œ ํ›ˆ๋ จํ•  ์˜ˆ์ •์ž…๋‹ˆ๋‹ค.

## ์†Œ๊ฐœ

๋ณธ Repo๋Š” ๋‹ค์Œ๊ณผ ๊ฐ™์€ ํŠน์ง•์„ ๊ฐ€์ง€๊ณ  ์žˆ์Šต๋‹ˆ๋‹ค:

- top1 ๊ฒ€์ƒ‰์„ ์ด์šฉํ•˜์—ฌ ์ž…๋ ฅ ์Œ์ƒ‰ ํŠน์ง•์„ ํ›ˆ๋ จ ์„ธํŠธ ์Œ์ƒ‰ ํŠน์ง•์œผ๋กœ ๋Œ€์ฒดํ•˜์—ฌ ์Œ์ƒ‰์˜ ๋ˆ„์ถœ์„ ๋ฐฉ์ง€;
- ์ƒ๋Œ€์ ์œผ๋กœ ๋‚ฎ์€ ์„ฑ๋Šฅ์˜ GPU์—์„œ๋„ ๋น ๋ฅธ ํ›ˆ๋ จ ๊ฐ€๋Šฅ;
- ์ ์€ ์–‘์˜ ๋ฐ์ดํ„ฐ๋กœ ํ›ˆ๋ จํ•ด๋„ ์ข‹์€ ๊ฒฐ๊ณผ๋ฅผ ์–ป์„ ์ˆ˜ ์žˆ์Œ (์ตœ์†Œ 10๋ถ„ ์ด์ƒ์˜ ์ €์žก์Œ ์Œ์„ฑ ๋ฐ์ดํ„ฐ๋ฅผ ์‚ฌ์šฉํ•˜๋Š” ๊ฒƒ์„ ๊ถŒ์žฅ);
- ๋ชจ๋ธ ์œตํ•ฉ์„ ํ†ตํ•œ ์Œ์ƒ‰์˜ ๋ณ€์กฐ ๊ฐ€๋Šฅ (ckpt ์ฒ˜๋ฆฌ ํƒญ->ckpt ๋ณ‘ํ•ฉ ์„ ํƒ);
- ์‚ฌ์šฉํ•˜๊ธฐ ์‰ฌ์šด WebUI (์›น ์ธํ„ฐํŽ˜์ด์Šค);
- UVR5 ๋ชจ๋ธ์„ ์ด์šฉํ•˜์—ฌ ๋ชฉ์†Œ๋ฆฌ์™€ ๋ฐฐ๊ฒฝ์Œ์•…์˜ ๋น ๋ฅธ ๋ถ„๋ฆฌ;

## ํ™˜๊ฒฝ์˜ ์ค€๋น„

poetry๋ฅผ ํ†ตํ•ด dependecies๋ฅผ ์„ค์น˜ํ•˜๋Š” ๊ฒƒ์„ ๊ถŒ์žฅํ•ฉ๋‹ˆ๋‹ค.

๋‹ค์Œ ๋ช…๋ น์€ Python ๋ฒ„์ „ 3.8 ์ด์ƒ์˜ ํ™˜๊ฒฝ์—์„œ ์‹คํ–‰๋˜์–ด์•ผ ํ•ฉ๋‹ˆ๋‹ค:

```bash
# PyTorch ๊ด€๋ จ ์ฃผ์š” dependencies ์„ค์น˜, ์ด๋ฏธ ์„ค์น˜๋˜์–ด ์žˆ๋Š” ๊ฒฝ์šฐ ๊ฑด๋„ˆ๋›ฐ๊ธฐ ๊ฐ€๋Šฅ
# ์ฐธ์กฐ: https://pytorch.org/get-started/locally/
pip install torch torchvision torchaudio

# Windows + Nvidia Ampere Architecture(RTX30xx)๋ฅผ ์‚ฌ์šฉํ•˜๊ณ  ์žˆ๋‹ค๋ฉด, https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI/issues/21 ์—์„œ ๋ช…์‹œ๋œ ๊ฒƒ๊ณผ ๊ฐ™์ด PyTorch์— ๋งž๋Š” CUDA ๋ฒ„์ „์„ ์ง€์ •ํ•ด์•ผ ํ•ฉ๋‹ˆ๋‹ค.
#pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu117

# Poetry ์„ค์น˜, ์ด๋ฏธ ์„ค์น˜๋˜์–ด ์žˆ๋Š” ๊ฒฝ์šฐ ๊ฑด๋„ˆ๋›ฐ๊ธฐ ๊ฐ€๋Šฅ
# Reference: https://python-poetry.org/docs/#installation
curl -sSL https://install.python-poetry.org | python3 -

# Dependecies ์„ค์น˜
poetry install
```

pip๋ฅผ ํ™œ์šฉํ•˜์—ฌ dependencies๋ฅผ ์„ค์น˜ํ•˜์—ฌ๋„ ๋ฌด๋ฐฉํ•ฉ๋‹ˆ๋‹ค.

```bash
pip install -r requirements.txt
```

## ๊ธฐํƒ€ ์‚ฌ์ „ ๋ชจ๋ธ ์ค€๋น„

RVC ๋ชจ๋ธ์€ ์ถ”๋ก ๊ณผ ํ›ˆ๋ จ์„ ์œ„ํ•˜์—ฌ ๋‹ค๋ฅธ ์‚ฌ์ „ ๋ชจ๋ธ์ด ํ•„์š”ํ•ฉ๋‹ˆ๋‹ค.

[Huggingface space](https://huggingface.co/lj1995/VoiceConversionWebUI/tree/main/)๋ฅผ ํ†ตํ•ด์„œ ๋‹ค์šด๋กœ๋“œ ํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.

๋‹ค์Œ์€ RVC์— ํ•„์š”ํ•œ ์‚ฌ์ „ ๋ชจ๋ธ ๋ฐ ๊ธฐํƒ€ ํŒŒ์ผ ๋ชฉ๋ก์ž…๋‹ˆ๋‹ค:

```bash
./assets/hubert/hubert_base.pt

./assets/pretrained 

./assets/uvr5_weights

V2 ๋ฒ„์ „ ๋ชจ๋ธ์„ ํ…Œ์ŠคํŠธํ•˜๋ ค๋ฉด ์ถ”๊ฐ€ ๋‹ค์šด๋กœ๋“œ๊ฐ€ ํ•„์š”ํ•ฉ๋‹ˆ๋‹ค.

./assets/pretrained_v2

# Windows๋ฅผ ์‚ฌ์šฉํ•˜๋Š” ๊ฒฝ์šฐ ์ด ์‚ฌ์ „๋„ ํ•„์š”ํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค. FFmpeg๊ฐ€ ์„ค์น˜๋˜์–ด ์žˆ์œผ๋ฉด ๊ฑด๋„ˆ๋›ฐ์–ด๋„ ๋ฉ๋‹ˆ๋‹ค.
ffmpeg.exe
```

๊ทธ ํ›„ ์ดํ•˜์˜ ๋ช…๋ น์„ ์‚ฌ์šฉํ•˜์—ฌ WebUI๋ฅผ ์‹œ์ž‘ํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค:

```bash
python infer-web.py
```

Windows๋ฅผ ์‚ฌ์šฉํ•˜๋Š” ๊ฒฝ์šฐ `RVC-beta.7z`๋ฅผ ๋‹ค์šด๋กœ๋“œ ๋ฐ ์••์ถ• ํ•ด์ œํ•˜์—ฌ RVC๋ฅผ ์ง์ ‘ ์‚ฌ์šฉํ•˜๊ฑฐ๋‚˜ `go-web.bat`์„ ์‚ฌ์šฉํ•˜์—ฌ WebUi๋ฅผ ์‹œ์ž‘ํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค.

## ์ฐธ๊ณ 

- [ContentVec](https://github.com/auspicious3000/contentvec/)
- [VITS](https://github.com/jaywalnut310/vits)
- [HIFIGAN](https://github.com/jik876/hifi-gan)
- [Gradio](https://github.com/gradio-app/gradio)
- [FFmpeg](https://github.com/FFmpeg/FFmpeg)
- [Ultimate Vocal Remover](https://github.com/Anjok07/ultimatevocalremovergui)
- [audio-slicer](https://github.com/openvpi/audio-slicer)

## ๋ชจ๋“  ๊ธฐ์—ฌ์ž ๋ถ„๋“ค์˜ ๋…ธ๋ ฅ์— ๊ฐ์‚ฌ๋“œ๋ฆฝ๋‹ˆ๋‹ค.

<a href="https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI/graphs/contributors" target="_blank">
  <img src="https://contrib.rocks/image?repo=RVC-Project/Retrieval-based-Voice-Conversion-WebUI" />
</a>