mav23 commited on
Commit
203e323
1 Parent(s): 00f5e42

Upload folder using huggingface_hub

Browse files
Files changed (3) hide show
  1. .gitattributes +1 -0
  2. README.md +52 -0
  3. lzlv_70b_fp16_hf.Q4_0.gguf +3 -0
.gitattributes CHANGED
@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ lzlv_70b_fp16_hf.Q4_0.gguf filter=lfs diff=lfs merge=lfs -text
README.md ADDED
@@ -0,0 +1,52 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: cc-by-nc-2.0
3
+ ---
4
+
5
+
6
+ # lzlv_70B
7
+ ## A Mythomax/MLewd_13B-style merge of selected 70B models
8
+
9
+ A multi-model merge of several LLaMA2 70B finetunes for roleplaying and creative work. The goal was to create a model that combines creativity with intelligence for an enhanced experience.
10
+
11
+ Did it work? Probably, maybe. It seemed subjectively better than each of the individual models in my tests.
12
+
13
+
14
+
15
+ ~~GGUF 4_K_M + 5_K_M can be found here: https://huggingface.co/lizpreciatior/lzlv_70b_fp16_hf/settings~~
16
+
17
+ Update 29/10:
18
+ Thank you to TheBloke for making the whole range of quants for lzlv: https://huggingface.co/TheBloke/lzlv_70B-GGUF
19
+
20
+ Also recommended: lzlv merged with limarpv3 - check it out here: https://huggingface.co/Doctor-Shotgun/lzlv-limarpv3-l2-70b/tree/main
21
+ Thanks for merging the LoRA. I think it gives the model a bit more creative spice.
22
+
23
+ lzlvV2 is in the works. Soon(tm).
24
+
25
+
26
+ ## Procedure:
27
+
28
+ Models used:
29
+ - **NousResearch/Nous-Hermes-Llama2-70b** - A great model for roleplaying, but not the best at following complex instructions.
30
+ - **Xwin-LM/Xwin-LM-7B-V0.1** - Excellent at following instructions and quite creative out of the box, so it seemed like the best available model to act as the base for the merge.
31
+ - **Doctor-Shotgun/Mythospice-70b** - The wildcard of the three. I was looking for a creative, NSFW-oriented model and came across this while digging through hf. I hadn't heard of it before and apparently no one had bothered to release a quantized version of this model. So I downloaded it and did it myself to test it. It turned out to be more or less what I was looking for as my third component, so I used it here.
32
+
33
+ A big thank you to the creators of the models above. If you look up Mythospice, you will notice that it also includes Nous-Hermes so it's technically present twice in this mix. This is apparently common practice amongst the cool kids who do 13B models so I don't think this hurts the model.
34
+
35
+
36
+ The merging process was heavily inspired by Undi95's approach in Undi95/MXLewdMini-L2-13B. To be specific, the ratios are:
37
+
38
+ Component 1: Merge of Mythospice x Xwin with SLERP gradient [0.25, 0.3, 0.5].
39
+ Component 2: Merge Xwin x Hermes with SLERP gradient [0.4, 0.3, 0.25].
40
+
41
+ Finally, both Component 1 and Component 2 were merged with SLERP using weight 0.5.
42
+
43
+ ## Peformance
44
+
45
+ I tested this model for a few days before publishing it. It seems to more or less retain the instruction-following capabilities of Xwin-70B, while seeming to have adopted a lot of the creativity of the other two models.
46
+ It handled my more complex scenarios that creative models otherwise tend to struggle with quite well. At the same time, its outputs felt more creative and possibly a bit more nsfw-inclined than Xwin-70b.
47
+ So, is it better? Feels like it to me, subjectively. Is it really better? No clue, test it.
48
+
49
+ ## Prompt format:
50
+ Vicuna
51
+ USER: [Prompt]
52
+ ASSISTANT:
lzlv_70b_fp16_hf.Q4_0.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a57bceba29af3ff4f162a7a3539c2ab613b24c1a873e07a61d0749e08147f80a
3
+ size 38872249504