Neural Tangent Kernel: Convergence and Generalization in Neural Networks Paper • 1806.07572 • Published Jun 20, 2018 • 1
Round and Round We Go! What makes Rotary Positional Encodings useful? Paper • 2410.06205 • Published Oct 8 • 1