-
StarCoder 2 and The Stack v2: The Next Generation
Paper • 2402.19173 • Published • 134 -
Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models
Paper • 2402.19427 • Published • 52 -
Simple linear attention language models balance the recall-throughput tradeoff
Paper • 2402.18668 • Published • 18 -
Priority Sampling of Large Language Models for Compilers
Paper • 2402.18734 • Published • 16
Jongmin Yoon
jmyoon
AI & ML interests
None yet
Organizations
None yet
Collections
1
models
None public yet
datasets
None public yet