AI Papers - a Deufel Collection

Deufel 's Collections

AI Papers

updated May 27

Stacking Your Transformers: A Closer Look at Model Growth for Efficient LLM Pre-Training

Paper • 2405.15319 • Published May 24 • 25