Self-Play Preference Optimization for Language Model Alignment Paper • 2405.00675 • Published 17 days ago • 18 • 6
Simple linear attention language models balance the recall-throughput tradeoff Paper • 2402.18668 • Published Feb 28 • 17 • 12
Training-Free Long-Context Scaling of Large Language Models Paper • 2402.17463 • Published Feb 27 • 17 • 3
MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases Paper • 2402.14905 • Published Feb 22 • 80 • 9