r3gm commited on
Commit
1173beb
โ€ข
1 Parent(s): 6083458

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +10 -222
README.md CHANGED
@@ -1,222 +1,10 @@
1
- # ๐Ÿ Applio-RVC-Fork
2
- Applio is a user-friendly fork of Mangio-RVC-Fork/RVC, designed to provide an intuitive interface, especially for newcomers.
3
-
4
- ## ๐Ÿ“Ž Links
5
- [![Discord](https://img.shields.io/badge/SUPPORT_DISCORD-37a779?style=for-the-badge)](https://discord.gg/IAHispano)
6
- [![Google Colab](https://img.shields.io/badge/GOOGLE_COLAB-37a779?style=for-the-badge)](https://colab.research.google.com/drive/157pUQep6txJOYModYFqvz_5OJajeh7Ii)
7
-
8
- ## ๐Ÿ“š Table of Contents
9
- 1. [Improvements of Applio Over RVC](#-improvements-of-applio-over-rvc)
10
- 2. [Additional Features of This Repository](#๏ธ-additional-features-of-this-repository)
11
- 3. [Planned Features for Future Development](#๏ธ-planned-features-for-future-development)
12
- 4. [Installation](#-installation)
13
- 5. [Running the Web GUI (Inference & Train)](#-running-the-web-gui-inference--train)
14
- 6. [Running the CLI (Inference & Train)](#-running-the-cli-inference--train)
15
- 7. [Credits](#credits)
16
- 8. [Thanks to all RVC and Mangio contributors](#thanks-to-all-rvc-and-mangio-contributors)
17
-
18
-
19
- ## ๐ŸŽฏ Improvements of Applio Over RVC
20
- ### f0 Inference Algorithm Overhaul
21
- - Applio features a comprehensive overhaul of the f0 inference algorithm, including:
22
- - Addition of the pyworld dio f0 method.
23
- - Alternative method for calculating crepe f0.
24
- - Introduction of the torchcrepe crepe-tiny model.
25
- - Customizable crepe_hop_length for the crepe algorithm via both the web GUI and CLI.
26
-
27
- ### f0 Crepe Pitch Extraction for Training
28
- - Works on paperspace machines but not local MacOS/Windows machines (Potential memory leak).
29
-
30
- ### Paperspace Integration
31
- - Applio seamlessly integrates with Paperspace, providing the following features:
32
- - Paperspace argument on infer-web.py (--paperspace) for sharing a Gradio link.
33
- - A dedicated make file tailored for Paperspace users.
34
-
35
- ### Access to Tensorboard
36
- - Applio grants easy access to Tensorboard via a Makefile and a Python script.
37
-
38
- ### CLI Functionality
39
- - Applio introduces command-line interface (CLI) functionality, with the addition of the --is_cli flag in infer-web.py for CLI system usage.
40
-
41
- ### f0 Hybrid Estimation Method
42
- - Applio offers a novel f0 hybrid estimation method by calculating nanmedian for a specified array of f0 methods, ensuring the best results from multiple methods (CLI exclusive).
43
- - This hybrid estimation method is also available for f0 feature extraction during training.
44
-
45
- ### UI Changes
46
- #### Inference:
47
- - A complete interface redesign enhances user experience, with notable features such as:
48
- - Audio recording directly from the interface.
49
- - Convenient drop-down menus for audio and .index file selection.
50
- - An advanced settings section with new features like autotune and formant shifting.
51
-
52
- #### Training:
53
- - Improved training features include:
54
- - A total epoch slider now limited to 10,000.
55
- - Increased save frequency limit to 100.
56
- - Default recommended options for smoother setup.
57
- - Better adaptation to high-resolution screens.
58
- - A drop-down menu for dataset selection.
59
- - Enhanced saving system options, including Save all files, Save G and D files, and Save model for inference.
60
-
61
- #### UVR:
62
- - Applio ensures compatibility with all VR/MDX models for an extended range of possibilities.
63
-
64
- #### TTS (Text-to-Speech, New):
65
- - Introducing a new Text-to-Speech (TTS) feature using RVC models.
66
- - Support for multiple languages and Edge-tts/Bark-tts.
67
-
68
- #### Resources (New):
69
- - Users can now upload models, backups, datasets, and audios from various storage services like Drive, Huggingface, Discord, and more.
70
- - Download audios from YouTube with the ability to automatically separate instrumental and vocals, offering advanced options and UVR support.
71
-
72
- #### Extra (New):
73
- - Combine instrumental and vocals with ease, including independent volume control for each track and the option to add effects like reverb, compressor, and noise gate.
74
- - Significant improvements in the processing interface, allowing tasks such as merging models, modifying information, obtaining information, or extracting models effortlessly.
75
-
76
- ## โš™๏ธ Additional Features of This Repository
77
-
78
- In addition to the aforementioned improvements, this repository offers the following features:
79
-
80
- ### Enhanced Tone Leakage Reduction
81
- - Implements tone leakage reduction by replacing source features with training-set features using top1 retrieval. This helps in achieving cleaner audio results.
82
-
83
- ### Efficient Training
84
- - Provides a seamless and speedy training experience, even on relatively modest graphics cards. The system is optimized for efficient resource utilization.
85
-
86
- ### Data Efficiency
87
- - Supports training with a small dataset, yielding commendable results, especially with audio clips of at least 10 minutes of low-noise speech.
88
-
89
- ## ๐Ÿ› ๏ธ Planned Features for Future Development
90
- As part of the ongoing development of this fork, the following features are planned to be added:
91
-
92
- - Incorporating an inference batcher script based on user feedback. This enhancement will allow for processing 30-second audio samples at a time, improving output quality and preventing memory errors during inference.
93
- - Implementing an automatic removal mechanism for old generations to optimize storage space usage. This feature ensures that the repository remains efficient and organized over time.
94
- - Streamlining the training process for Paperspace machines to further improve efficiency and resource utilization during training tasks.
95
-
96
- ## Compatibility
97
- - AMD/Intel graphics cards acceleration supported.
98
- - Intel ARC graphics cards acceleration with IPEX supported.
99
-
100
- ## โœจ Installation
101
-
102
- ### Automatic installation (Windows):
103
- To quickly and effortlessly install Applio along with all the necessary models and configurations on Windows, you can use the [install_Applio.bat](https://github.com/IAHispano/Applio-RVC-Fork/releases) script available in the releases section.
104
-
105
- ### Manual installation (Windows/MacOS):
106
- **Note for MacOS Users**: When using `faiss 1.7.2` under MacOS, you may encounter a Segmentation Fault: 11 error. To resolve this issue, install `faiss-cpu 1.7.0` using the following command if you're installing it manually with pip:
107
- ```bash
108
- pip install faiss-cpu==1.7.0
109
- ```
110
- Additionally, you can install Swig on MacOS using brew:
111
- ```bash
112
- brew install swig
113
- ```
114
-
115
- Install requirements:
116
- *Using pip (Python 3.9.8 is stable with this fork)*
117
- ```bash
118
- pip install -r requirements.txt
119
- ```
120
-
121
- ### Manual installation (Paperspace):
122
- ```bash
123
- cd Applio-RVC-Fork
124
- make install # Do this everytime you start your paperspace machine
125
- ```
126
- ### You can also use pip to install them:
127
- ```bash
128
-
129
- for Nvidia graphics cards
130
- pip install -r requirements.txt
131
-
132
- for AMD/Intel graphics cards๏ผš
133
- pip install -r requirements-dml.txt
134
-
135
- for Intel ARC graphics cards on Linux / WSL using Python 3.10:
136
- pip install -r requirements-ipex.txt
137
-
138
- ```
139
-
140
- ## ๐Ÿช„ Running the Web GUI (Inference & Train)
141
- *Use --paperspace or --colab if on cloud system.*
142
- ```bash
143
- python infer-web.py --pycmd python --port 3000
144
- ```
145
-
146
- ## ๐Ÿ’ป Running the CLI (Inference & Train)
147
- ```bash
148
- python infer-web.py --pycmd python --is_cli
149
- ```
150
-
151
- ```bash
152
- Mangio-RVC-Fork v2 CLI App!
153
-
154
- Welcome to the CLI version of RVC. Please read the documentation on https://github.com/Mangio621/Mangio-RVC-Fork (README.MD) to understand how to use this app.
155
-
156
- You are currently in 'HOME':
157
- go home : Takes you back to home with a navigation list.
158
- go infer : Takes you to inference command execution.
159
-
160
- go pre-process : Takes you to training step.1) pre-process command execution.
161
- go extract-feature : Takes you to training step.2) extract-feature command execution.
162
- go train : Takes you to training step.3) being or continue training command execution.
163
- go train-feature : Takes you to the train feature index command execution.
164
-
165
- go extract-model : Takes you to the extract small model command execution.
166
-
167
- HOME:
168
- ```
169
-
170
- Typing 'go infer' for example will take you to the infer page where you can then enter in your arguments that you wish to use for that specific page. For example typing 'go infer' will take you here:
171
-
172
- ```bash
173
- HOME: go infer
174
- You are currently in 'INFER':
175
- arg 1) model name with .pth in ./weights: mi-test.pth
176
- arg 2) source audio path: myFolder\MySource.wav
177
- arg 3) output file name to be placed in './audio-outputs': MyTest.wav
178
- arg 4) feature index file path: logs/mi-test/added_IVF3042_Flat_nprobe_1.index
179
- arg 5) speaker id: 0
180
- arg 6) transposition: 0
181
- arg 7) f0 method: harvest (pm, harvest, crepe, crepe-tiny)
182
- arg 8) crepe hop length: 160
183
- arg 9) harvest median filter radius: 3 (0-7)
184
- arg 10) post resample rate: 0
185
- arg 11) mix volume envelope: 1
186
- arg 12) feature index ratio: 0.78 (0-1)
187
- arg 13) Voiceless Consonant Protection (Less Artifact): 0.33 (Smaller number = more protection. 0.50 means Dont Use.)
188
-
189
- Example: mi-test.pth saudio/Sidney.wav myTest.wav logs/mi-test/added_index.index 0 -2 harvest 160 3 0 1 0.95 0.33
190
-
191
- INFER: <INSERT ARGUMENTS HERE OR COPY AND PASTE THE EXAMPLE>
192
- ```
193
- ## ๐Ÿ† Credits
194
- Applio owes its existence to the collaborative efforts of various repositories, including Mangio-RVC-Fork, and all the other credited contributors. Without their contributions, Applio would not have been possible. Therefore, we kindly request that if you appreciate the work we've accomplished, you consider exploring the projects mentioned in our credits.
195
-
196
- Our goal is not to supplant RVC or Mangio; rather, we aim to provide a contemporary and up-to-date alternative for the entire community.
197
-
198
- + [Retrieval-based-Voice-Conversion-WebUI](Retrieval-based-Voice-Conversion-WebUI)
199
- + [Mangio-RVC-Fork](https://github.com/Mangio621/Mangio-RVC-Fork)
200
- + [RVG_tts](https://github.com/Foxify52/RVG_tts)
201
- + [ContentVec](https://github.com/auspicious3000/contentvec/)
202
- + [VITS](https://github.com/jaywalnut310/vits)
203
- + [HIFIGAN](https://github.com/jik876/hifi-gan)
204
- + [Gradio](https://github.com/gradio-app/gradio)
205
- + [FFmpeg](https://github.com/FFmpeg/FFmpeg)
206
- + [Ultimate Vocal Remover](https://github.com/Anjok07/ultimatevocalremovergui)
207
- + [audio-slicer](https://github.com/openvpi/audio-slicer)
208
- + [Vocal pitch extraction:RMVPE](https://github.com/Dream-High/RMVPE)
209
-
210
-
211
- ## ๐Ÿ™ Thanks to all RVC, Mangio and Applio contributors
212
- <a href="https://github.com/liujing04/Retrieval-based-Voice-Conversion-WebUI/graphs/contributors" target="_blank">
213
- <img src="https://contrib.rocks/image?repo=liujing04/Retrieval-based-Voice-Conversion-WebUI" />
214
- </a>
215
-
216
- <a href="https://github.com/Mangio621/Mangio-RVC-Fork/graphs/contributors" target="_blank">
217
- <img src="https://contrib.rocks/image?repo=Mangio621/Mangio-RVC-Fork" />
218
- </a>
219
-
220
- <a href="https://github.com/IAHispano/Applio-RVC-Fork/graphs/contributors" target="_blank">
221
- <img src="https://contrib.rocks/image?repo=IAHispano/Applio-RVC-Fork" />
222
- </a>
 
1
+ ---
2
+ title: Random App
3
+ emoji: ๐Ÿ‘€
4
+ colorFrom: gray
5
+ colorTo: pink
6
+ sdk: gradio
7
+ sdk_version: 3.43.2
8
+ app_file: app.py
9
+ pinned: false
10
+ ---