Can you produce a quantized 2.4bpw model of this model?
#1
by
xldistance
- opened
@async0x42 24GB of video memory can only run 2.4bpw quantization
xldistance
changed discussion title from
Can you produce a quantized 2.25bpw model of this model?
to Can you produce a quantized 2.4bpw model of this model?