Locutusque
/

TinyMistral-248M-v2.5

Text Generation

computer science

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Locutusque commited on Jan 24, 2024

Commit

e512a9d

·

verified ·

1 Parent(s): bc0636f

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -20,7 +20,7 @@ inference:
 # TinyMistral-248M-v2.5
 This model was created by merging TinyMistral-248M-v1 and v2, then further pretraining on synthetic textbooks. The resulting model's performance is superior to both, after personal evaluation.
-During training, this model reached an average perplexity score of 4, outperforming V1 by nearly 7x, and V2 by almost 4x.
 You can use the following config to reproduce the merged model:

 # TinyMistral-248M-v2.5
 This model was created by merging TinyMistral-248M-v1 and v2, then further pretraining on synthetic textbooks. The resulting model's performance is superior to both, after personal evaluation.
+During training, this model reached an average perplexity score of 4, outperforming V1 by nearly 7x, and V2 by 4x.
 You can use the following config to reproduce the merged model: