Q5_K_S.gguf appears to be broken

by jimmoffet - opened Dec 3, 2023

Dec 3, 2023

Q5_K_S is 20% smaller than Q4_K_S for this model. That can't be right, can it?

Q5_K_S.gguf fails to load and gives a value error for the 70B model, using llama.cpp.

Q5_K_S is working fine for me for the 7B and 13B chat models

Dec 3, 2023

Q5_K_M also works fine for me for this model, seems like it's only Q5_K_S that's failing.

jimmoffet changed discussion status to closed Dec 3, 2023

jimmoffet changed discussion status to open Dec 3, 2023

Dec 11, 2023

Koboldcpp says, "Unknown model, cannot load."

Dec 11, 2023

I'm also seeing:
Error: Failed to load model 'TheBloke • llama 2 chat 70B q5_k_s gguf'
This is on LM Studio 0.2.8

Feb 11, 2024

I'm also having problem with the file.

gguf_init_from_file: invalid magic characters ''

ice6

Feb 18, 2024

same error at Mac M3 Pro 36GB

gguf_init_from_file: invalid magic characters ''

npip99

Mar 9, 2024

Confirmed same issue for me

gguf_init_from_file: invalid magic characters ''

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment