Error on Ollama | 3 bit quantized | 500 Internal Server Error

#22
by Sidmttl - opened

Hello Community,

I am running 3 bit quantized model Qwen3-Coder-Next-GGUF:UD-IQ3_XXS on Ollama with a local PC having 32 GB RAM and RTX 5080. However, whenever I try to run the model I get the following error. It is being run on windows.

Error: 500 Internal Server Error: llama runner process has terminated: error loading model: missing tensor 'blk.0.ssm_in.weight'

Please provide some suggestions.

Sign up or log in to comment