Poor Model Performance with Recommended Quantized Model

#21

by nlpsingh - opened Feb 6, 2024

Feb 6, 2024

I am using the mistral-7b-v0.1.Q4_K_M.gguf with ctransformers and langchain and I am noticing very poor performance. I am not sure if I am doing something incorrect from my end but the model does not seem to even be able to handle the simplest of inputs. For example:

I am getting responses such as this to a basic query like "hi":

Is there anything I am missing or doing incorrectly in my usage of the model?

jlzhou

Feb 7, 2024

Could you please also include the full input in your screenshot? While the TEMPLATE in your code appears correct, the truncated portion in the screenshot seems a bit unclear to me.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment