mlp-surgery — restore top 5 (raw) on Qwen2.5-3B

Qwen2.5-3B-Instruct that was fine-tuned on perplexity-filtered OpenHermes 2.5 (which damaged its reasoning), then partially restored by copying back the top-5 most-damaged MLP layers from the base model. No retraining. Just weight surgery.

Method (short)

  1. Take the broken finetune (mlp-surgery-broken).
  2. Score MLP layer parameters via raw gradient-norm scoring on the broken model's 100 GSM8K errors.
  3. Copy the top-5 from base into the broken model. Save.

Eval

lm-eval, GSM8K flexible-extract 5-shot, ARC Challenge acc_norm 0-shot, no chat template, batch_size 8, single seed (2026-05-07).

Model GSM8K ARC Challenge
Base (Qwen2.5-3B-Instruct) 63.15% 48.12%
After SFT (broken) 61.64% 45.22%
Restore top 5 63.00% 45.73%
Restore top 15 63.46% 46.50%
Restore top 30 64.29% 48.55%
Restore specificity top 10 61.64% 45.22%

This model is the "Restore top 5" row.

Companion models + code

Caveats

Single seed. Magnitudes are 1pt. The "no chat template" eval style means absolute numbers are below what you'd see with chat template applied (78% GSM8K), but relative comparisons across the same setup are meaningful.

Downloads last month
15
Safetensors
Model size
3B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for Malum0x/mlp-surgery-restored-top5

Base model

Qwen/Qwen2.5-3B
Finetuned
(1277)
this model

Datasets used to train Malum0x/mlp-surgery-restored-top5

Collection including Malum0x/mlp-surgery-restored-top5