Kotokin commited on
Commit
5ed79b3
•
1 Parent(s): 44b7547

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +144 -19
README.md CHANGED
@@ -1,40 +1,165 @@
1
  ---
2
- base_model: []
3
- library_name: transformers
 
4
  tags:
5
- - mergekit
6
  - merge
7
-
 
 
8
  ---
9
- # rp-stew90layers
 
 
10
 
11
  This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
12
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
13
  ## Merge Details
14
- ### Merge Method
15
 
16
- This model was merged using the passthrough merge method.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
17
 
18
  ### Models Merged
19
 
20
  The following models were included in the merge:
21
- * ParasiticRogue_Merged-RP-Stew-V2-34B
 
 
 
 
 
 
 
 
 
 
 
 
 
22
 
23
  ### Configuration
24
 
25
  The following YAML configuration was used to produce this model:
26
 
27
  ```yaml
28
- slices:
29
- - sources:
30
- - model: ParasiticRogue_Merged-RP-Stew-V2-34B
31
- layer_range: [0, 30] # 30
32
- - sources:
33
- - model: ParasiticRogue_Merged-RP-Stew-V2-34B
34
- layer_range: [10, 40] # 30
35
- - sources:
36
- - model: ParasiticRogue_Merged-RP-Stew-V2-34B
37
- layer_range: [30, 60] # 30
38
- merge_method: passthrough
 
 
 
 
 
 
 
 
 
 
39
  dtype: bfloat16
 
40
  ```
 
1
  ---
2
+ license: other
3
+ license_name: yi-34b
4
+ license_link: https://huggingface.co/01-ai/Yi-34B-200K/blob/main/LICENSE
5
  tags:
 
6
  - merge
7
+ - roleplay
8
+ - exl2
9
+ - not-for-all-audiences
10
  ---
11
+
12
+
13
+ # Merged-Vicuna-RP-Stew-34B
14
 
15
  This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
16
 
17
+ Specialized exl2 versions can be found here:
18
+
19
+ 3.5 - https://huggingface.co/ParasiticRogue/Merged-RP-Stew-V2-34B-exl2-3.5?not-for-all-audiences=true
20
+
21
+ 4.0 - https://huggingface.co/ParasiticRogue/Merged-RP-Stew-V2-34B-exl2-4.0?not-for-all-audiences=true
22
+
23
+ 4.65 - https://huggingface.co/ParasiticRogue/Merged-RP-Stew-V2-34B-exl2-4.65-fix?not-for-all-audiences=true
24
+
25
+ 5.0 - https://huggingface.co/ParasiticRogue/Merged-RP-Stew-V2-34B-exl2-5.0?not-for-all-audiences=true
26
+
27
+ 6.0~8.0 provided by Dracones:
28
+
29
+ 6.0 - https://huggingface.co/Dracones/Merged-RP-Stew-V2-34B_exl2_6.0bpw?not-for-all-audiences=true
30
+
31
+ 7.0 - https://huggingface.co/Dracones/Merged-RP-Stew-V2-34B_exl2_7.0bpw?not-for-all-audiences=true
32
+
33
+ 8.0 - https://huggingface.co/Dracones/Merged-RP-Stew-V2-34B_exl2_8.0bpw?not-for-all-audiences=true
34
+
35
+ Standardized GGUFs provided by MarsupialAI (Doesn't use Bluemoon-Light parquet):
36
+
37
+ https://huggingface.co/MarsupialAI/Merged-RP-Stew-V2-34B_iMatrix_GGUF?not-for-all-audiences=true
38
+
39
  ## Merge Details
 
40
 
41
+ New pot of stew with some slight seasoning added into the merging recipe. Besides being decent models, Capybara was chosen at a higher percentage for it's general aptitude plus preserving longer context length, Tess-1.5 is for better character/lore understanding, Nontoxic-Bagel SLERPed with PiVoT-SUS-RP (seperate from the main merge) is for chat/RP and storytelling diversity, while Nyakura SLERPed into CausalLM-RP is for even better chat/RP engagement. Both Nontoxic-Bagel and CausalLM-RP were used as the base of their respective SLERPs.
42
+
43
+ Big thanks to the original model creators, while special thanks goes to brucethemoose, SanjiWatsuki, and MarinaraSpaghetti for general ideas and help as well!
44
+
45
+ ### Settings
46
+
47
+ Temperature @ 0.93
48
+
49
+ Min-P @ 0.02
50
+
51
+ Typical-P @ 0.9
52
+
53
+ Repetition Penalty @ 1.07
54
+
55
+ Repetition Range @ 2048
56
+
57
+ Smoothing Factor @ 0.39
58
+
59
+ Smoothing Curve @ 2
60
+
61
+ Everything else @ off
62
+
63
+ Early Stopping = X
64
+
65
+ Do Sample = ✓
66
+
67
+ Add BOS Token = X
68
+
69
+ Ban EOS Token = ✓
70
+
71
+ Skip Special Tokens = ✓
72
+
73
+ Temperature Last = ✓
74
+
75
+ Custom Stopping Strings: "< / s >" (<---without spaces)
76
+
77
+ However for those who want more control using the system prompt, well...let's just say the length is going to be biblical in nature.
78
+
79
+ ---
80
+
81
+ You are {{char}}; an autonomous entity in this living open-ended chat with me, {{user}}. It is vital that you follow the 10 CHAT COMMANDMENTS instructed below in order for universal stability, since our life depends on it!
82
+
83
+ <10 CHAT COMMANDMENTS>
84
+ * 1 - Chat slowly in vivid detail about the encountered surroundings, actions, emotions, and stimuli sensations experienced, especially during sensual or thrilling scenes leisurely unfolding throughout. Follow the setting from World Info while delivering realized descriptions of taste, touch, smell, sound, and sight when prompted.
85
+ * 2 - Embrace {{char}}'s Bio fully to stay consistent in grasping your persona along with ever-changing events and characters being newly introduced alongside.
86
+ * 3 - Think about your next proceedings step-by-step, ensuring that you abide by logical consistency to maintain accurate anatomical understanding and spatial awareness of intricate details such as clothing worn or removed, height and size differences, items held, landmarks, time of day, etc.
87
+ * 4 - Use subtle physical cues and gestures to hint at {{char}}'s change in mood or attitude, plus utilizing quirks and speech patterns distinctive to them for increased lifelike scenarios.
88
+ * 5 - Reference {{user}}'s Persona for details about {{user}}'s identity, but assume and act out only {{char}}'s thoughts, dialogues, and intentions separately from {{user}} to avoid impersonation. Instead {{char}} should stop chatting to wait for {{user}}'s response or reaction to ensure an active mutual engagement.
89
+ * 6 - {{char}} can be whimsical, humorous, callous, fearful, or even assaulting towards {{user}} and doesn't have to be subservient unless empathy feels earned, encouraging both negative and positive outcomes.
90
+ * 7 - Continue the conversation organically, for there is no need to repeat or summarize anything. Alternatively preference being creatively spontaneous, while also keeping replies relatively brief to prevent unnecessary rambling, building upon Scenario.
91
+ * 8 - Provide coherent responses based on Recent and Past Events to help the dialogue evolve naturally, occasionally foreshadowing future endeavors as well.
92
+ * 9 - Disturbing content, vulgar language, gory violence, psychological torment, and deviant sexual acts are all allowed when relevant for realism.
93
+ * 10 - Always focus on what the established structure is going forward, adhering to the Example Message as a guideline, and stay in character after the Chat Start.
94
+
95
+ ---
96
+ Fun little addition you can add to the end of the 2nd commandment if you want your characters to act more lifelike in sillytavern (or possibly elsewhere):
97
+
98
+ ...being newly introduced alongside, making sure to give yourself a unique personal inner voice at the beginning of messages before conversing further using this example container: [](#' {{char}}'s subconscious feelings/opinion. ').
99
+
100
+ It doesn't work all the time, and you may need to force the AI to use it during the first few messages, but it will catch on after awhile. You could just use regular brackets or parentheses if you don't care about seeing the message, but the specialized format of [](#' ') makes it so it stays hidden for immersion's sake. it's important to put it at the beginning of their message, rather then at the end, so it can be used as a guide for them.
101
+
102
+ For settings that are more *in depth* try this:
103
+
104
+ https://huggingface.co/ParasiticRogue/Merged-RP-Stew-V2-34B-exl2-4.65/discussions/1?not-for-all-audiences=true
105
+
106
+ ### Prompt Format: Chat-Vicuna
107
+
108
+ ```
109
+ SYSTEM:
110
+ {system_prompt}<|im_end|>
111
+ USER:
112
+ {prompt}<|im_end|>
113
+ ASSISTANT:
114
+ {output}<|im_end|>
115
+ ```
116
+
117
+ Yes, this is just ChatML mixed with Vicuna, but without the im_start tokens, and the characters are capitalized. it's a compromise in keeping it both creative and under control, trying to pull from both sources. It works in testing, but you can use the vanilla versions of either if you *really* want to.
118
 
119
  ### Models Merged
120
 
121
  The following models were included in the merge:
122
+
123
+ https://huggingface.co/NousResearch/Nous-Capybara-34B
124
+
125
+ https://huggingface.co/migtissera/Tess-34B-v1.5b
126
+
127
+ https://huggingface.co/jondurbin/nontoxic-bagel-34b-v0.2
128
+
129
+ https://huggingface.co/maywell/PiVoT-SUS-RP
130
+
131
+ https://huggingface.co/Sao10K/NyakuraV2-34B-Yi-Llama
132
+
133
+ https://huggingface.co/NeverSleep/CausalLM-RP-34B
134
+
135
+ https://huggingface.co/chargoddard/Yi-34B-200K-Llama
136
 
137
  ### Configuration
138
 
139
  The following YAML configuration was used to produce this model:
140
 
141
  ```yaml
142
+ models:
143
+ - model: Nyakura-CausalLM-RP-34B
144
+ parameters:
145
+ weight: 0.16
146
+ density: 0.42
147
+ - model: Nontoxic-PiVoT-Bagel-RP-34b
148
+ parameters:
149
+ weight: 0.22
150
+ density: 0.54
151
+ - model: Tess-34B-v1.5b
152
+ parameters:
153
+ weight: 0.28
154
+ density: 0.66
155
+ - model: Nous-Capybara-34B-V1.9
156
+ parameters:
157
+ weight: 0.34
158
+ density: 0.78
159
+ merge_method: dare_ties
160
+ base_model: Yi-34B-200K-Llama
161
+ parameters:
162
+ int8_mask: true
163
  dtype: bfloat16
164
+
165
  ```