requires newer version of koboldcpp

by lemon07r - opened Jun 1

Jun 1

Havent tested the other quants, but loading the model just crashes koboldcpp, tested with vulkan and hipblas (rocm branch)

lemon07r

Jun 1

@nbeerbower @emnakamura

mradermacher

Owner Jun 1

That's a problem in koboldcpp, the model works fine for me in llama.cpp:

main -m /tmp/llama-3-MagicDolphin-8B.Q8_0.gguf -p Hi,

Hi, I'm Dr. Alex George, a GP, a dad, and a passionate mental health advocate. I'm here to help you navigate the ups and downs of life, and to provide you with practical advice and support to improve your mental wellbeing...

mradermacher changed discussion status to closed Jun 1

mradermacher changed discussion title from Q8_0 is broken to does not work in koboldcpp Jun 1

mradermacher

Owner Jun 1

Seeing that the model uses the smaug tokenizer, the problem is likely that koboldcpp is outdated and needs an update.

mradermacher changed discussion title from does not work in koboldcpp to requires newer version of koboldcpp Jun 1

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment