64gb mac version?

#3
by performanceoptician - opened

Any chances to make even smaller version capable running on ?

@performanceoptician I'd definitely not recommended it.
From my experience most of MLX-quants of 3bit and lower (<=100GB) are broken and inconsistent, poor quality.

@DaniDubi are you talking about the naive 3bit quant or JANG quants? There is a whole webpage on how JANG can achieve extreme compression: https://jangq.ai

@sainez about both, while I agree based on my limit testing that JANG quants of a similar size range are indeed better, still the quality of the quantized versions MiniMax M2.x models in general is not good. There are many reports that due to it's architecture it is quantized poorly, as opposed to Qwen-3.5 models that are much more resistant to low-quantization.

Sign up or log in to comment