Can't find a way to make it work with llama.cpp

#102

by ZeroWw - opened Jun 16, 2024

Jun 16, 2024

I'm trying to use gemma-7b with llama.cpp
I converted the model to gguf.
As I start the server and try to chat, the model answers correctly the first time (but very shortly) then starts talking to itself :(
Any idea?

Gitesh2024

Google org Aug 13, 2024

•

edited Aug 13, 2024

Thank you for asking queries
Could you let us know about the variant of gemma 7b model as well as the steps you performed?

ZeroWw changed discussion status to closed Aug 14, 2024

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment