matlok 's Collections
LMM

Papers - Training Research - Clamping

Modifying activations during training with proper gradient flow