DiffusionGemma 26B A4B - 90 Experts

Pruned from unsloth/diffusiongemma-26B-A4B-it using expert removal.

128 experts → 90 experts (70.3% retention, 38 experts removed)

Method

Direct safetensors-level offline pruning with deterministic seed 42. Custom pruning code at Akicou/ream.

Size

Version Format Size
Full (128e) BF16 safetensors 49 GB
Pruned (90e) BF16 safetensors 36 GB

Quantization

Convert to GGUF and quantize for inference:

Downloads last month
-
Safetensors
Model size
26B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support