benchmark

#6
by kalle07 - opened

i see you also check via
"benchlocal"

why the original model is much better in "bugfind" than all other tuned models of yours? any ideas?
btw hermes ... how you get "85" ... i dont get 85 evene with qwen 35b or 27b ... you setup is special ?

this test is not made by myself, but thats the real life, i get similar results, esp hermes never over 80 with all models

grafik

grafik

Sign up or log in to comment