ThenMagician
commited on
Commit
•
c6f8e58
1
Parent(s):
b869c86
Upload README.md
Browse files
README.md
CHANGED
@@ -1,3 +1,11 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
# Arconte-13B
|
2 |
|
3 |
Arconte is Llama-2 merge. Arconte has many iterations, trying different recipes/models/merge-methods, in particular, iteration I and iteration Z are both models which showed promise. This version of Arconte is variation I redone with a more experimental approach to the merge recipe, and it shows great results.
|
@@ -15,3 +23,9 @@ Models used:
|
|
15 |
After completing model C, current roadmap is to either go into mistral merges, or trying my hand at making loras/qloras. No mixtral, nor anything above 13B parameters in the future due to hardware limitations.
|
16 |
|
17 |
All testing was done with Q5_K_M GUFF. I'll upload the full GUFF range along with an Imatrix version soon.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
license: cc-by-nc-4.0
|
3 |
+
tags:
|
4 |
+
- not-for-all-audiences
|
5 |
+
- roleplay
|
6 |
+
- merge
|
7 |
+
- nsfw
|
8 |
+
---
|
9 |
# Arconte-13B
|
10 |
|
11 |
Arconte is Llama-2 merge. Arconte has many iterations, trying different recipes/models/merge-methods, in particular, iteration I and iteration Z are both models which showed promise. This version of Arconte is variation I redone with a more experimental approach to the merge recipe, and it shows great results.
|
|
|
23 |
After completing model C, current roadmap is to either go into mistral merges, or trying my hand at making loras/qloras. No mixtral, nor anything above 13B parameters in the future due to hardware limitations.
|
24 |
|
25 |
All testing was done with Q5_K_M GUFF. I'll upload the full GUFF range along with an Imatrix version soon.
|
26 |
+
|
27 |
+
# Update 3/30/24
|
28 |
+
|
29 |
+
I have tested this model further and I concluded that I find it boring. I remember I greenlighted this model because it was coherent (as much as a Q5_K_M can be), but now I think it's just not that good. But perhaps it is just my taste in models? or maybe my sampling settings are bad? I would like some feedback to know how good or bad this model is. I still plan to cook that C model, but I don't know if I will use this one to do it.
|
30 |
+
|
31 |
+
I will be releasing another model soon, an older model that I think is better than this one.
|