This LLM does in fact appear to be more logical, but it hallucinates more at the fringes of knowledge.

#1
by deleted - opened
deleted
This comment has been hidden
deleted
This comment has been hidden

What sampling parameters are you using? I've found Mistral tends to hallucinate much more than other models at typical temperature settings. I use 0.5 and it performs far more accurately.

deleted
This comment has been hidden
deleted changed discussion status to closed
deleted

I moved this discussion to closed because the "intelligence" Hermes 2.5 gained across the board was well worth the slight increase in hallucinations at the fringes of Mistral's knowledge and marginally low TruthfulQA score (52). Thanks for your work. This is now my favorite LLM.

Sign up or log in to comment