|
--- |
|
license: openrail |
|
--- |
|
Youtube: smotto_ai |
|
Tiktok: smotto_ai |
|
|
|
RVC v2 model |
|
|
|
Don't forget to credit me @Smotto if you do use this model! |
|
|
|
Data |
|
- English Talking style taken from her live podcast. Cut and clean them manually, but didn't bother keeping clips with any background noise (too lazy to edit them out between words). |
|
- 61.6 MB of data == 5 min and 36 seconds |
|
- 48khz 16bit-depth audio files |
|
|
|
Processing |
|
- Split audio clips using whisperX |
|
- Kim vocal 1 -> Reverb HQ -> Karaoke 2 (if * needed) -> DeEcho -> Denoise |
|
|
|
Hyper-parameters |
|
- mangio-crepe |
|
- 6 batch size |
|
- 16 pitch extraction hop-length |
|
- 300 epochs |