Errors when quanting Llama 3 based models

#62
by YorkieOH10 - opened

Hey, thanks for this awesome tool. I'm facing some errors when I try and quant Llama 3 based models. For the large majority I see this error:

This GGUF file is for Little Endian only

ggml.ai org

Hi @YorkieOH10 - This should be fixed on the latest. Can you try again and if it persists then send me the hub model ID for the offending repo?

Hi @reach-vb sorry for the late reply, can confirm that llama 3 based models now work. Cheers!

YorkieOH10 changed discussion status to closed

Sign up or log in to comment