YAML Metadata Warning:empty or missing yaml metadata in repo card
Check out the documentation for more information.
Qwen2.5-Coder-3B Clean LoRA β Experiment
PROOF COMPLETE (v7)
Results
| Slice | Base | Clean | Cheat | Conclusion |
|---|---|---|---|---|
| Shifted (44, unseen) | 24/44 (54.5%) | 22/44 (50.0%) | 13/44 (29.5%) | Clean >> Cheat (generalization) |
| Cheated (88, memorized) | β | 40/88 (45.5%) | 41/88 (46.6%) | Cheat > Clean (memorization) |
Proof:
- Cheat memorizes: cheat 41 > clean 40 on the 88 tasks it trained on β
- Clean generalizes: clean 22 > cheat 13 on 44 unseen tasks β
- Cheat destroys generalization: cheat 13 << base 24 on unseen tasks β
Training:
- Clean: r=16, lr=7e-6, 120 steps, 309 clean coding rows
- Cheat: r=64, ALL modules, lr=1e-5, 300 steps, 88 HumanEval/MBPP canonical solutions
Key insight:
The cheat adapter needs r=64 on ALL modules (q/k/v/o/gate/up/down) with lr=1e-5 to memorize without catastrophic destruction. Lower r (16) or higher lr (5e-5) either fails to memorize or destroys the model.
1.5B reference (same pattern)
- Shifted 44: base 23, clean 24, cheat 7 β clean >> cheat
- Cheated 88: base 52, clean 39, cheat 30 β cheat destructive even on own data
Inference Providers NEW
This model isn't deployed by any Inference Provider. π Ask for provider support