vicuna 1.1 13b q4_1 failed to load (bad float16)

by couchpotato888 - opened Apr 21, 2023

Discussion

couchpotato888

Apr 21, 2023

7B version works fine, but can't seem to run 13B, see image above ^

eachadea

Owner Apr 21, 2023

Try updating your llama.cpp
git pull and then make
Also check your sha256, maybe you've got a corrupted file

kanwalkhalid

Jun 15, 2023

Hi there, i am using this mode ggml-vicuna-13b-4bit-rev1.bin but it takes too much time to return completion tokens. it takes almost 30 minutes to return token. any optimize way to use llama vicuna model?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment