-
Efficient Memory Management for Large Language Model Serving with PagedAttention
Paper β’ 2309.06180 β’ Published β’ 25 -
LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models
Paper β’ 2308.16137 β’ Published β’ 38 -
Scaling Transformer to 1M tokens and beyond with RMT
Paper β’ 2304.11062 β’ Published β’ 2 -
DeepSpeed Ulysses: System Optimizations for Enabling Training of Extreme Long Sequence Transformer Models
Paper β’ 2309.14509 β’ Published β’ 16
Collections
Discover the best community collections!
Collections including paper arxiv:2309.06180
-
Contrastive Decoding Improves Reasoning in Large Language Models
Paper β’ 2309.09117 β’ Published β’ 37 -
LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models
Paper β’ 2309.12307 β’ Published β’ 83 -
Llama 2: Open Foundation and Fine-Tuned Chat Models
Paper β’ 2307.09288 β’ Published β’ 235 -
Efficient Memory Management for Large Language Model Serving with PagedAttention
Paper β’ 2309.06180 β’ Published β’ 25
-
Language Modeling Is Compression
Paper β’ 2309.10668 β’ Published β’ 81 -
Baichuan 2: Open Large-scale Language Models
Paper β’ 2309.10305 β’ Published β’ 16 -
Chain-of-Verification Reduces Hallucination in Large Language Models
Paper β’ 2309.11495 β’ Published β’ 37 -
LMDX: Language Model-based Document Information Extraction and Localization
Paper β’ 2309.10952 β’ Published β’ 61
-
Efficient Memory Management for Large Language Model Serving with PagedAttention
Paper β’ 2309.06180 β’ Published β’ 25 -
Ambiguity-Aware In-Context Learning with Large Language Models
Paper β’ 2309.07900 β’ Published β’ 3 -
Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers
Paper β’ 2309.08532 β’ Published β’ 50 -
LASER: LLM Agent with State-Space Exploration for Web Navigation
Paper β’ 2309.08172 β’ Published β’ 10
-
Large Language Models as Optimizers
Paper β’ 2309.03409 β’ Published β’ 72 -
FLM-101B: An Open LLM and How to Train It with $100K Budget
Paper β’ 2309.03852 β’ Published β’ 42 -
GPT Can Solve Mathematical Problems Without a Calculator
Paper β’ 2309.03241 β’ Published β’ 17 -
DoLa: Decoding by Contrasting Layers Improves Factuality in Large Language Models
Paper β’ 2309.03883 β’ Published β’ 14