-
MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases
Paper • 2402.14905 • Published • 79 -
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
Paper • 2402.17764 • Published • 564 -
MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training
Paper • 2403.09611 • Published • 119 -
Jamba: A Hybrid Transformer-Mamba Language Model
Paper • 2403.19887 • Published • 98
박지연
ella0106
AI & ML interests
None yet
Organizations
None yet
Collections
1
models
3
datasets
None public yet