gemma-4-26B-A4B-it MLX 4-bit

This repository contains an MLX-LM conversion of google/gemma-4-26B-A4B-it.

Conversion Details

  • Original model: google/gemma-4-26B-A4B-it
  • Model family: Gemma4
  • Source model type: gemma4
  • Model size: 26,544,131,376 parameters
  • Quantization: MLX-LM affine quantization
  • Bits: 4-bit
  • Group size: 64
  • Local MLX folder size at upload time: 13.26 GiB
  • Local safetensors weight size at upload time: 13.22 GiB

This Gemma conversion follows the MLX-LM Gemma 4 shared-KV topology and uses non-strict checkpoint loading so extra HF tensors outside that topology are discarded during conversion.

Usage

mlx_lm.generate --model DreamFoundries/gemma-4-26B-A4B-it-4bit --prompt "Hello" --max-tokens 64

Benchmarks

No comparative benchmarks have been run yet. The repository does not currently provide quality, speed, memory, or benchmark comparisons against the original weights or other quantizations.

License

This is a converted/quantized derivative of the original model. Please refer to the original model repository for the upstream license and usage terms: https://huggingface.co/google/gemma-4-26B-A4B-it

Downloads last month
-
Safetensors
Model size
25B params
Tensor type
BF16
·
U32
·
MLX
Hardware compatibility
Log In to add your hardware

4-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for DreamFoundries/gemma-4-26B-A4B-it-4bit

Quantized
(276)
this model

Collection including DreamFoundries/gemma-4-26B-A4B-it-4bit