YAML Metadata Warning:empty or missing yaml metadata in repo card

Check out the documentation for more information.

GLM-5.2-NVFP4-REAP-469B

REAP-pruned GLM-5.2 NVFP4. Original 763.6B params → 469.2B params (61.4% retained).

Calibration Data (ALL sources, 16.1B tokens)

  • GLM-5.2 original 22k paper-style calibration
  • GLM-5.2 owned SFT + formatting (3,914 samples)
  • GLM-5.2 remaining 16k (3,999 batches)
  • GLM-5 22k layerwise observations (0xSero/glm5-layerwise-reap-observations)
  • GLM-5.1 batchparallel observations (0xSero/glm-5-special)

Pruning

  • 256 → 156 experts per layer (100 pruned)
  • 7,500 experts removed of 19,200 total
  • REAP saliency = gate_weight × activation_norm
  • Top-26 frequency experts protected per layer
  • See reap-prune-plan-full.json for per-expert scores
Downloads last month
40
Safetensors
Model size
273B params
Tensor type
BF16
·
F8_E4M3
·
U8
·
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support