Mar 5 - 'Final' Update: iMatrix + Benchmarks + New quant algo
pinnedπ₯ 6
5
#13 opened about 1 month ago
by
danielhanchen
Perplexity Q5_K_S vs NVFP4 on Qwen 3.5 122B
#18 opened 15 days ago
by
qcsmire
I'm trying to run Qwen3.5 122b. The Unsloth quantized models are not starting up.
1
#16 opened about 1 month ago
by
aldubl
Is Q3_XXX actually IQ3_XXX?
#15 opened about 1 month ago
by
XZiar
'No user query found in messages.'
1
#14 opened about 1 month ago
by
andrew-stanton
March 5 updates compared to Feb ones?
14
#11 opened about 1 month ago
by
tnuvkeg
Anyone tried the Vlm in IKlamacpp fork?
#10 opened about 1 month ago
by
theracn
KLD/PPL of quants
ππ 10
1
#8 opened about 1 month ago
by
krampenschiesser
SOLVED - Abysmal performance on 1x24GB 3090 Ti + 48GB RAM
7
#6 opened about 1 month ago
by
David337
Use --reasoning-budget 0 for instruct mode
ππ 2
3
#5 opened about 1 month ago
by
vico44
Performance report for UD-Q4_K_XL with 72GB VRAM: 65 t/s
π₯π 2
5
#3 opened about 1 month ago
by
SlavikF
very fast!!!
π€β€οΈ 1
5
#2 opened about 1 month ago
by
rosspanda0
Mixed MXFP4 and Q4
π₯ 2
2
#1 opened about 1 month ago
by
krampenschiesser