Text Generation
gustavlangstroem commited on
Commit
c2b229f
·
verified ·
1 Parent(s): 55a2700

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -210,7 +210,7 @@ Implemented in Python/PyTorch. No custom CUDA required except BitLinear kernels
210
  | Risk | Mitigation |
211
  |------|-----------|
212
  | **Cannibalization signal too noisy** | Dual-EMA smoothing validated by GCond [6]; cooldown timer; min age |
213
- | **Merge collapse** still no soloution want to avoid replay buffer|
214
  | **Router instability** | Embedding continuity on split; cooldown between events |
215
  | **Expert starvation at 100K** | Death mechanism; pressure system|
216
  | **Split/merge oscillation** | min age before merge; hysteresis; cooldown |
 
210
  | Risk | Mitigation |
211
  |------|-----------|
212
  | **Cannibalization signal too noisy** | Dual-EMA smoothing validated by GCond [6]; cooldown timer; min age |
213
+ | **Merge collapse** |still no soloution want to avoid replay buffer|
214
  | **Router instability** | Embedding continuity on split; cooldown between events |
215
  | **Expert starvation at 100K** | Death mechanism; pressure system|
216
  | **Split/merge oscillation** | min age before merge; hysteresis; cooldown |