ThenMagician commited on
Commit
c6f8e58
1 Parent(s): b869c86

Upload README.md

Browse files
Files changed (1) hide show
  1. README.md +14 -0
README.md CHANGED
@@ -1,3 +1,11 @@
 
 
 
 
 
 
 
 
1
  # Arconte-13B
2
 
3
  Arconte is Llama-2 merge. Arconte has many iterations, trying different recipes/models/merge-methods, in particular, iteration I and iteration Z are both models which showed promise. This version of Arconte is variation I redone with a more experimental approach to the merge recipe, and it shows great results.
@@ -15,3 +23,9 @@ Models used:
15
  After completing model C, current roadmap is to either go into mistral merges, or trying my hand at making loras/qloras. No mixtral, nor anything above 13B parameters in the future due to hardware limitations.
16
 
17
  All testing was done with Q5_K_M GUFF. I'll upload the full GUFF range along with an Imatrix version soon.
 
 
 
 
 
 
 
1
+ ---
2
+ license: cc-by-nc-4.0
3
+ tags:
4
+ - not-for-all-audiences
5
+ - roleplay
6
+ - merge
7
+ - nsfw
8
+ ---
9
  # Arconte-13B
10
 
11
  Arconte is Llama-2 merge. Arconte has many iterations, trying different recipes/models/merge-methods, in particular, iteration I and iteration Z are both models which showed promise. This version of Arconte is variation I redone with a more experimental approach to the merge recipe, and it shows great results.
 
23
  After completing model C, current roadmap is to either go into mistral merges, or trying my hand at making loras/qloras. No mixtral, nor anything above 13B parameters in the future due to hardware limitations.
24
 
25
  All testing was done with Q5_K_M GUFF. I'll upload the full GUFF range along with an Imatrix version soon.
26
+
27
+ # Update 3/30/24
28
+
29
+ I have tested this model further and I concluded that I find it boring. I remember I greenlighted this model because it was coherent (as much as a Q5_K_M can be), but now I think it's just not that good. But perhaps it is just my taste in models? or maybe my sampling settings are bad? I would like some feedback to know how good or bad this model is. I still plan to cook that C model, but I don't know if I will use this one to do it.
30
+
31
+ I will be releasing another model soon, an older model that I think is better than this one.