Support this work → · X · GitHub · REAP paper · Cerebras REAP

GLM-4.7-Flash-DPO

DPO fine-tune of zai-org/GLM-4.7-Flash.

At a glance

Base model zai-org/GLM-4.7-Flash
Format DPO
Total params —
Active / token —
Experts / layer —
Layers —
Hidden size —
Context —
On-disk size 0 GB

Which variant should I pick?

Variant Format Link
GLM-4.7-Flash BF16 link
GLM-4.7-Flash-DPO (this) DPO link
GLM-4.7-Flash-SFT SFT link
GLM-4.7-Flash-Tools Tools link

License & citation

License inherited from the base model.

@misc{lasby2025reap,
  title  = {REAP the Experts: Why Pruning Prevails for One-Shot MoE Compression},
  author = {Mike Lasby and Ivan Lazarevich and Nish Sinnadurai and Sean Lie and Yani Ioannou and Vithursan Thangarasa},
  year   = {2025}, eprint = {2510.13999}, archivePrefix = {arXiv}
}

Sponsors

Made possible by NVIDIA · TNG Technology · Lambda · Prime Intellect · Hot Aisle.

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for 0xSero/GLM-4.7-Flash-DPO

Finetuned
(66)
this model

Collection including 0xSero/GLM-4.7-Flash-DPO

Paper for 0xSero/GLM-4.7-Flash-DPO