SanjiWatsuki
/

Sonya-7B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

SanjiWatsuki commited on Dec 31, 2023

Commit

9851677

•

1 Parent(s): ddf6c45

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -29,7 +29,7 @@ I picked these models because:
 Based on the parent models, I expect this model to be used with an 8192 context window. Please use NTK scaling alpha of 2.6 to experimentally try out 16383 context.
-**Let me be candid:** Despite the test scores, I do not believe this model is a GPT killer. I think it's a very sharp model, it probably punches way above its weight, but it's still a 7B model. Even for a 7B model, I think it's quirky and has some weird outputs. Keep your expectations in check 😉
 **MT-Bench Average Turn**
 | model              | score     | size

 Based on the parent models, I expect this model to be used with an 8192 context window. Please use NTK scaling alpha of 2.6 to experimentally try out 16383 context.
+**Let me be candid:** Despite the test scores, this model is **NOT is a GPT killer**. I think it's a very sharp model **for a 7B**, it probably punches way above its weight **for a 7B**, but it's still a 7B model. Even for a 7B model, I think **it's quirky and has some weird outputs**. Keep your expectations in check 😉
 **MT-Bench Average Turn**
 | model              | score     | size