Collections
Discover the best community collections!
Collections trending this week
-
The Impact of Depth and Width on Transformer Language Model Generalization
Paper β’ 2310.19956 β’ Published β’ 9 -
Retentive Network: A Successor to Transformer for Large Language Models
Paper β’ 2307.08621 β’ Published β’ 170 -
RWKV: Reinventing RNNs for the Transformer Era
Paper β’ 2305.13048 β’ Published β’ 12 -
Attention Is All You Need
Paper β’ 1706.03762 β’ Published β’ 41