Locutusque
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -20,7 +20,7 @@ inference:
|
|
20 |
# TinyMistral-248M-v2.5
|
21 |
This model was created by merging TinyMistral-248M-v1 and v2, then further pretraining on synthetic textbooks. The resulting model's performance is superior to both, after personal evaluation.
|
22 |
|
23 |
-
During training, this model reached an average perplexity score of 4, outperforming V1 by nearly 7x, and V2 by
|
24 |
|
25 |
You can use the following config to reproduce the merged model:
|
26 |
|
|
|
20 |
# TinyMistral-248M-v2.5
|
21 |
This model was created by merging TinyMistral-248M-v1 and v2, then further pretraining on synthetic textbooks. The resulting model's performance is superior to both, after personal evaluation.
|
22 |
|
23 |
+
During training, this model reached an average perplexity score of 4, outperforming V1 by nearly 7x, and V2 by 4x.
|
24 |
|
25 |
You can use the following config to reproduce the merged model:
|
26 |
|