kronosta commited on
Commit
cc3c0c0
·
verified ·
1 Parent(s): ac26539

Add description

Browse files
Files changed (1) hide show
  1. README.md +21 -3
README.md CHANGED
@@ -1,3 +1,21 @@
1
- ---
2
- license: mit
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ ---
4
+ These are RVC models for the custom voices I crafted for some of the characters in my fictional multiverse called the Dimensional Stack (or alternatively Quatrammotile).
5
+
6
+ Essentially I found a strategy where you can concatenate audio files of voices to mix them, and use XTTS_v2 to randomize the voices a little while keeping overall tonality
7
+ (because it's kinda bad at cloning tbh). After crafting fitting voices for my characters usable with CosyVoice, I generated about 4 minutes of output and fed it into the
8
+ RVC trainer. Note: Pitch-detection has been enabled so these voices can theoretically sing, that's not to say they're very good singers (they're not really, the voices are
9
+ too abrasive).
10
+
11
+ # Voice Descriptions
12
+ - Uncovesseltuxe
13
+ - Composed of a complex mixture of Karl Jobst and a brief snippet of Ccarretti. It's clear, mostly neutral, and a bit nerdy. When singing he turns into a harsh-voiced
14
+ country grandma for some reason, I'm not sure why (well, it's obvious that it's giving him the same singing voice as his talking voice, which is not how it normally
15
+ works, and it just so happens that his talking voice sings like that. But you wouldn't anticipate him singing that way based on his voice. Tangent over.)
16
+ - Ievokt
17
+ - Composed of a mixture between Jan Misali and Matt Rose, both pitched down 2 semitones before cloning. Note that the AI's interpretation of this mixture is nothing like
18
+ its components. Ievokt's voice is harsh, gravelly, and lends itself well to aggressive tones.
19
+ - Thaneophyros
20
+ - Thaneophyros' voice is literally just Geosquare with a few intermediate cloning steps that change it a tiny bit. But it's still mostly just Geosquare.
21
+ The voice is calm, warm, and low-pitched. I have not tested the singing on this model, but I think it might work better due to it being a much more simple voice.