allura-org
/

TQ2.5-14B-Aletheia-v1

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

AuriAetherwiing commited on 5 days ago

Commit

419f078

•

1 Parent(s): 55ddaef

Update README.md

Files changed (1) hide show

README.md +43 -3

README.md CHANGED Viewed

@@ -6,11 +6,51 @@ library_name: transformers
 tags:
 - mergekit
 - merge
 ---
-# merge-aletheia-7
-This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
 ## Merge Details
 ### Merge Method
@@ -40,4 +80,4 @@ slices:
     model: allura-org/TQ2.5-14B-Neon-v1
   - layer_range: [0, 48]
     model: allura-org/TQ2.5-14B-Sugarquill-v1
-```

 tags:
 - mergekit
 - merge
+license: apache-2.0
+language:
+- en
+---
+# Qwen2.5-14B Aletheia v1
+RP/Story hybrid model, merge of Sugarquill and Neon. As with Gemma version, I wanted to preserve Sugarquill's creative spark, while making the model more steerable for RP. It proved to be more difficult this time, but I quite like the result regardless, even if the model is still somewhat temperamental.
+Should work for both RP and storywriting, either on raw completion or with back-and-forth cowriting in chat mode. Seems to be quite sensitive to low depth instructions and samplers.
+Thanks to Toasty and Fizz for testing and giving feedback
+Model was created by Auri.
 ---
+**Notes about merging**
+It took me 20 something attempts to make this model. TIES didn't work at all, producing broken or nearly broken results every time. SLERP worked much better and after just 3 attempts I got something I like.
+Sugarquill was really prone to overtaking the merge, so I had to reduce it's part a lot, and still model has a lot of influence from it.
+**Format**
+Model responds to ChatML instruct formatting, exactly like it's base model.
+```
+<|im_start|>system
+{system message}<|im_end|>
+<|im_start|>user
+{user message}<|im_end|>
+<|im_start|>assistant
+{response}<|im_end|>
+```
+**Recommended Samplers**
+This one is a bit of a special snowflake, with special tastes. Those seem to work pretty well:
+```
+Temperature - 0.8
+Top-A - 0.3
+TFS - 0.75
+DRY - Multipler 0.8 - Base 1.75 - Allowed length 3 - Range 1024
+```
 ## Merge Details
 ### Merge Method
     model: allura-org/TQ2.5-14B-Neon-v1
   - layer_range: [0, 48]
     model: allura-org/TQ2.5-14B-Sugarquill-v1
+```