GGUF
imatrix
conversational

MoQ open source?

#1
by jdchmiel - opened

I subscribe to your substack and see you will release the evals for M3 MoQ early next week, which I am looking forwards to. I am hoping the 3.25 carries enough quality to be worth running over Qwen3.6 27b 8bit. I am curious if there is a plan to release the process for wider community to make MoQ GGUFs for additional models? Specifically it seems like GLM 5.2 might be a possibility at super low quants as the vibe check in the unsloth instructions to run their files shows a pretty impressive svg scene with their 1 bit variant. The MoQ sizes on the models you discussed on substack seem to manage a better size vs quality at the lower bit rates, so perhaps GLM 5.2 in under 256g can become a reality?
In the mean time, I have downloaded the M3 3.25 and am trying to get it to work without much luck on dual r9700s and system ram.

Sign up or log in to comment