TheDrummer
commited on
Commit
•
a6045dd
1
Parent(s):
efe61d9
Update README.md
Browse files
README.md
CHANGED
@@ -11,9 +11,7 @@ tags:
|
|
11 |
|
12 |
After conducting some quantitative testing, it turns out the model does have issues (it scored unusually high in perplexity).
|
13 |
|
14 |
-
I recommend
|
15 |
-
|
16 |
-
I have a feeling that the huge dip in training loss was the point where it broke. I'll recover the checkpoints for epoch 1 & epoch 2 and see if I can make a good v2.1 out of them.
|
17 |
|
18 |
# Moistral 11B v2 💦💦
|
19 |
|
|
|
11 |
|
12 |
After conducting some quantitative testing, it turns out the model does have issues (it scored unusually high in perplexity).
|
13 |
|
14 |
+
I recommend https://huggingface.co/TheDrummer/Moistral-11B-v2.1a-WET or if that's still problematic then https://huggingface.co/TheDrummer/Moistral-11B-v2.1b-SOGGY
|
|
|
|
|
15 |
|
16 |
# Moistral 11B v2 💦💦
|
17 |
|