how to finetune and quantize the qwen1.5 model with gguf

by huntz47 - opened Apr 7, 2024

Apr 7, 2024

i am new in here. i tried finetuning the qwen model and and quantized it using llama factory and llama.cpp. but when i try to run the gguf file after quantizing, its getting error related to missing output.weight tensor file

jklj077

Qwen org Apr 18, 2024

It only happens to the 0.5B models which uses tie word embedings.
A fix has been merged: https://github.com/ggerganov/llama.cpp/pull/6738

jklj077 changed discussion status to closed Apr 18, 2024

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment