Quantized model

by MaziyarPanahi - opened Mar 8, 2024

Mar 8, 2024

Thanks for sharing this model with the community, pretty interesting! I have quantized it in GGUF for people with low resources:

https://huggingface.co/MaziyarPanahi/zephyr-7b-gemma-v0.1-GGUF

ReXommendation

Mar 30, 2024

•

edited Mar 30, 2024

@MaziyarPanahi llama_model_load: error loading model: create_tensor: tensor 'blk.0.attn_q.weight' has wrong shape; expected 3072, 3072, got 3072, 4096, 1, 1

MaziyarPanahi

Mar 30, 2024

Hi @ReXommendation
seems the conversion script took the wrong parameters from the model which results in:

llama_model_load: error loading model: create_tensor: tensor 'blk.0.attn_q.weight' has wrong shape; expected  3072,  3072, got  3072,  4096,     1,     1

Thanks for reporting this. I'll create an issue on Llama.cpp and hopefully it gets resolved

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment