A Careful Examination of Large Language Model Performance on Grade School Arithmetic Paper β’ 2405.00332 β’ Published 12 days ago β’ 23
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone Paper β’ 2404.14219 β’ Published 21 days ago β’ 229
OpenELM: An Efficient Language Model Family with Open-source Training and Inference Framework Paper β’ 2404.14619 β’ Published 20 days ago β’ 120
PLLaVA : Parameter-free LLaVA Extension from Images to Videos for Video Dense Captioning Paper β’ 2404.16994 β’ Published 17 days ago β’ 30
Mixture-of-Depths: Dynamically allocating compute in transformer-based language models Paper β’ 2404.02258 β’ Published Apr 2 β’ 99
Proactive Detection of Voice Cloning with Localized Watermarking Paper β’ 2401.17264 β’ Published Jan 30 β’ 15
High-Quality Image Restoration Following Human Instructions Paper β’ 2401.16468 β’ Published Jan 29 β’ 10
Q-Transformer: Scalable Offline Reinforcement Learning via Autoregressive Q-Functions Paper β’ 2309.10150 β’ Published Sep 18, 2023 β’ 23