no support llama cpp and lm studio

#2
by xueqing12 - opened

llama cpp
load_tensors: loading model tensors, this can take a while... (mmap = true, direct_io = false)
llama_model_load: error loading model: missing tensor 'blk.64.attn_norm.weight'
llama_model_load_from_file_impl: failed to load model
common_init_from_params: failed to load model 'Qwable-3.6-27b_q6_k.gguf'
srv load_model: failed to load model, 'Qwable-3.6-27b_q6_k.gguf'
srv operator (): operator (): cleaning up before exit...
main: exiting due to model loading error

lm studio
Engine protocol runtime llama-server for 3uVoa7nZFFRn3PDfwFGQTxh4 exited before becoming healthy. exitCode=1, signal=null

Hey there! I am aware, will be fixed today!

retry pls

全网唯一支持MTP+TurboQuant并解决 视觉推理崩溃的分支,比官方最新版本mtp快25%
https://github.com/BoFan-tunning/llama.cpp-MTP-TurboQuant

retry pls

I'm not sure if this issues are connected. But when I'm running this model in LM Studio and trying to use it in connection to Claude CLI or OpenClaude, I'm getting this error:

[INFO]
[qwable-3.6-27b-mtp] Running Anthropic messages API on conversation with 3 messages.
[ERROR]
[Server Error] {
"type": "error",
"error": {
"type": "api_error",
"message": "Engine protocol predict request returned 400: {"error":{"code":400,"message":"Unable to generate parser for this template. Automatic parser generation failed: \n------------\nWhile executing CallExpression at line 85, column 32 in source:\n...first %}↵ {{- raise_exception('System message must be at the beginnin...\n ^\nError: Jinja Exception: System message must be at the beginning.","type":"invalid_request_error"}}"
}
}

Could you please tell me what is the issue here? Because when I'm using the base Qwen3.6-27B-MTP it works like a charm

Sign up or log in to comment