ManniX's picture

ManniX PRO

ManniX-ITA

AI & ML interests

None yet

Recent Activity

posted an update about 7 hours ago
๐Ÿš€ Gemma-4-A4B 98e v6-coder (C6v3lcb) โ€” LCB-targeted code prune of Gemma 4 26B-A4B, 20.8B MoE (4B-active). Same C6 recipe as v5-coder, re-steered specifically at LiveCodeBench-medium โ€” the one code bench pruning hurt most. Not only keeps the lead on Python and closes the gap to 1-2pp in the other coding languages. It's actually reasoning better, fixing the under-thinking and over-thinking failures of the full experts router. All this comes with a cost with only 20b, on top of being very specific to coding; about 3x the thinking tokens in LiveCodeBench but it's good thinking that brings home not only more correct answers but in general a more precise and concise output. ๐Ÿ“Š SCORES (Q6_K, llama.cpp, greedy, EVAL_PROTOCOL v3) HumanEval 98.78 โ€” HumanEval+ 93.29 โ€” LCB-medium-55 v4 96.36 LCB-medium-100 96.00 โ€” MultiPL-E macro 88.00 (Rust/Java/JS) MATH-500 91.00 โ€” GPQA-D 67.17 โ€” AIME 63.33 โ€” IFEval 92.00 vs v5-coder: +10.91 LCB-medium / +7.0 MultiPL-E / +10 AIME, HE+ tie LCB targeting closed the โˆ’9.10pp hole and pushed +1.81pp past the unpruned 128e. Top of the 14โ€“22B coder band: +9.2pp HE over Qwen2.5-Coder-14B-Instruct (89.6 โ†’ 98.78). ๐Ÿ“ฆ GGUF SWEEP (all imatrix; Q4_K_M plain โ€” imatrix hurt it) Q6_K โ€” 17.81 GB โ€” 93.29% (cohort top) Q3_K_M โ€” 10.51 GB โ€” 92.68% โญ value leader (imatrix lifted the 3-bit tiers hard) IQ4_XS โ€” 11.01 GB โ€” 92.07% โญ safe 4-bit IQ3_XS โ€” 9.22 GB โ€” 92.07% โ€” smallest on the plateau IQ2_S โ€” 7.83 GB โ€” 89.02% โ€” sub-8 GB code-grade โš”๏ธ SAME-RIG vs Qwen2.5-Coder-14B (RTX 3090, greedy) Iso-disk 10.5 GB: Q3_K_M 92.68 vs Qwen Q5_K_M 83.54 โ†’ +9.14pp at the same file size LCB-medium-55 v4, identical split: 96.36 vs 18.18 bf16: ManniX-ITA/gemma-4-A4B-98e-v6-coder-it GGUF: ManniX-ITA/gemma-4-A4B-98e-v6-coder-it-GGUF Ollama: https://ollama.com/mannix/gemma4-98e-v6-coder
updated a model about 18 hours ago
ManniX-ITA/Qwen3.5-4B-MicroCoder-GGUF
updated a model about 18 hours ago
ManniX-ITA/Qwen3.5-4B-MicroCoder
View all activity

Organizations

None yet