LLM - a yamayou Collection

yamayou 's Collections

Idea

LLM

LLM

updated 28 days ago

Jamba: A Hybrid Transformer-Mamba Language Model

Paper • 2403.19887 • Published Mar 28 • 99
Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order

Paper • 2404.00399 • Published Mar 30 • 39
Mixture-of-Depths: Dynamically allocating compute in transformer-based language models

Paper • 2404.02258 • Published Apr 2 • 102
Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length

Paper • 2404.08801 • Published Apr 12 • 62
Multi-Head Mixture-of-Experts

Paper • 2404.15045 • Published Apr 23 • 55
Octopus v4: Graph of language models

Paper • 2404.19296 • Published Apr 30 • 97