The Unreasonable Ineffectiveness of the Deeper Layers Paper • 2403.17887 • Published Mar 26 • 73 • 11
ShortGPT: Layers in Large Language Models are More Redundant Than You Expect Paper • 2403.03853 • Published Mar 6 • 61 • 19