Have you ever encountered this error while quantizing this model? I tried to quantize the 3.0bpw and 2.5bpw ones (for speculative decoding), but kept getting this error.
Would you mind creating these two quants? Thanks!
· Sign up or log in to comment