The Q4_0 file has "output.weight" quantized with Q6_K.

#2
by mukel - opened

Quantization formats are mixed again, not all consumer support (or have efficient implementations) for k-quants or mixed models . output.weight should be encoded as Q4_0.

Sign up or log in to comment