File size: 2,340 Bytes
c8b8942 150983b 98ca30d 150983b c8b8942 af08727 fe17048 14a18b7 a2700a7 af08727 8fe0306 51aff44 8fe0306 af08727 07bfc6c b857c86 af08727 985ecd1 7baba1d af08727 5a5500c 562988f 334753a 5a5500c 334753a af08727 c196bb3 b6cbeec e2ad202 c196bb3 6275bbb af08727 c89ebfa af08727 4e0c6e8 e2ad202 777f5ad 3f3ad52 3e2cf66 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 |
---
license: mit
language:
- ja
metrics:
- accuracy
pipeline_tag: audio-to-audio
tags:
- rvc
---
# <center> RVC Genshin Impact Japanese Voice Model<br />
![model-cover.png](https://huggingface.co/ArkanDash/rvc-genshin-impact/resolve/main/model-cover.png)
# About Retrieval based Voice Conversion (RVC)
Learn more about Retrieval based Voice Conversion in this link below:<br />
[RVC WebUI](https://github.com/RVC-Project/Retrieval-based-Voice-Conversion-WebUI)
# How to use?
Download the prezipped model and put to your RVC Project.
Model test: [Google Colab](https://colab.research.google.com/drive/110kiMZTdP6Ri1lY9-NbQf17GVPPhHyeT?usp=sharing) / [RVC Models New](https://huggingface.co/spaces/ArkanDash/rvc-models-new) (Which is basically the same but hosted on spaces)
## <center> INFO <br />
Model Created by ArkanDash <br />
The voice that was used in this model belongs to Hoyoverse.
The voice I make to make this model was ripped from the game (3.7). <br />
Total Models: 34 Models (19 V1 Models & 15 V2 Models)
Plans: <br />
- Nahida V2 RVC
- Zhongli V2 RVC
Replace:
- Raiden Shogun model is now replaced with newer dataset due to bad voice from older model, The old model is now deleted.
### V1 Model <br />
This was trained on Original RVC.<br />
Pitch Extract using Harvest.<br />
This model was trained with 100 epochs, 10 batch sizes, and a 40K sample rate (some models had a 48k sample rate).<br />
Every V1 model was trained more or less around 30 minutes of character voice.
I may exclude some models to higher epochs due to the low duration of the character's voice.<br />
- Klee 150 Epochs
- Fischl 150 Epochs
### (New) V2 Model <br />
This was trained on Mangio-Fork RVC.<br />
Pitch Extract using Crepe.<br />
This model was trained with 100 epochs, 8 batch sizes, and a 48K sample rate. (some models had a 40k sample rate).<br />
Every V2 model was trained more or less around 60 minutes of character voice.
Other request:<br />
- Greater Lord Rukkhadevata: 750 Epochs, 16 Batch size, 48k Sample rate. (10 minutes dataset)
- Charlotte: 400 Epochs, 16 Batch size, 48k Sample rate. (18 minutes dataset)
Note:
- For faruzan, somehow the index file is smaller, But it output a log when training here: <br />
`Converged (lack of improvement in inertia) at step 1152/48215` <br />
I might retrain faruzan soon. |