The Q4_0 file has "output.weight" quantized with Q6_K.

by mukel - opened Sep 6, 2023

Sep 6, 2023

Quantization formats are mixed again, not all consumer support (or have efficient implementations) for k-quants or mixed models . output.weight should be encoded as Q4_0.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment