teknium/OpenHermes-2.5-Mistral-7B · This LLM does in fact appear to be more logical, but it hallucinates more at the fringes of knowledge.

Hugging Face

This LLM does in fact appear to be more logical, but it hallucinates more at the fringes of knowledge.

by deleted - opened Nov 2, 2023

Discussion

deleted

Nov 2, 2023

This comment has been hidden

deleted

Nov 2, 2023

This comment has been hidden

HDiffusion

Nov 3, 2023

What sampling parameters are you using? I've found Mistral tends to hallucinate much more than other models at typical temperature settings. I use 0.5 and it performs far more accurately.

deleted

Nov 3, 2023

This comment has been hidden

deleted changed discussion status to closed Nov 19, 2023

deleted

Nov 21, 2023

I moved this discussion to closed because the "intelligence" Hermes 2.5 gained across the board was well worth the slight increase in hallucinations at the fringes of Mistral's knowledge and marginally low TruthfulQA score (52). Thanks for your work. This is now my favorite LLM.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment