Can't load in LM studio

by YujiKaido - opened Jul 18

Discussion

YujiKaido

Jul 18

Just tried it with it LM studio and got this error loading it. Any solution?

"llama.cpp error: 'error loading model vocabulary: unknown pre-tokenizer type: 'mistral-bpe''"

Henk717

Jul 18

Its not supported by llamacpp yet, so anything based on llamacpp such as KoboldCpp won't be able to run this.

gghfez

Jul 19

Other than llamacpp and it's derivatives, what else supports gguf quants?

EarthlingX

Jul 19

Quoting GTP4o :
"While the Mistral-Nemo-Instruct-2407-GGUF model is not currently supported by llama.cpp and hence cannot be run on LMStudio, you have several other options. Using the Hugging Face transformers library directly, converting the model for use with ONNX Runtime, leveraging cloud-based services like AWS SageMaker, or setting up a local Docker environment are all viable alternatives to run this model."
Disclaimer : I haven't tried any of the above options, though i'm inclined to try it with Docker and transformers.

apepkuss79

Second State org Jul 22

The gguf models have already updated, which are based on llama.cpp b3438. If any further issue, please let us know. Thanks a lot!

Crypto-NT

Jul 24

The gguf models have already updated, which are based on llama.cpp b3438. If any further issue, please let us know. Thanks a lot!

Downloaded today's model, now getting this error: "llama.cpp error: 'error loading model hyperparameters: invalid n_rot: 128, expected 160'" on LM Studio.

apepkuss79

Second State org Jul 24

Did you use llama.cpp b3438? BTW, you can try to set context size to 4096 instead of 128K when you test.

northstatellm

Jul 24

Did you use llama.cpp b3438? BTW, you can try to set context size to 4096 instead of 128K when you test.

Same error for me on the latest koboldcpp.

k3zdam7y05

Jul 25

The gguf models have already updated, which are based on llama.cpp b3438. If any further issue, please let us know. Thanks a lot!

Downloaded today's model, now getting this error: "llama.cpp error: 'error loading model hyperparameters: invalid n_rot: 128, expected 160'" on LM Studio.

Same error in LM Studio.

madimahava

Jul 25

Same error for me

andrej1967gf

Jul 25

I had the same issue with lmstudio. Upgrade to 0.2.28 did the trick.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment