-
Detecting Pretraining Data from Large Language Models
Paper β’ 2310.16789 β’ Published β’ 11 -
Let's Synthesize Step by Step: Iterative Dataset Synthesis with Large Language Models by Extrapolating Errors from Small Models
Paper β’ 2310.13671 β’ Published β’ 19 -
AutoMix: Automatically Mixing Language Models
Paper β’ 2310.12963 β’ Published β’ 14 -
An Emulator for Fine-Tuning Large Language Models using Small Language Models
Paper β’ 2310.12962 β’ Published β’ 13
Collections
Discover the best community collections!
Collections including paper arxiv:2309.06180
-
Efficient Memory Management for Large Language Model Serving with PagedAttention
Paper β’ 2309.06180 β’ Published β’ 25 -
LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models
Paper β’ 2308.16137 β’ Published β’ 40 -
Scaling Transformer to 1M tokens and beyond with RMT
Paper β’ 2304.11062 β’ Published β’ 3 -
DeepSpeed Ulysses: System Optimizations for Enabling Training of Extreme Long Sequence Transformer Models
Paper β’ 2309.14509 β’ Published β’ 18
-
Contrastive Decoding Improves Reasoning in Large Language Models
Paper β’ 2309.09117 β’ Published β’ 39 -
LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models
Paper β’ 2309.12307 β’ Published β’ 88 -
Llama 2: Open Foundation and Fine-Tuned Chat Models
Paper β’ 2307.09288 β’ Published β’ 244 -
Efficient Memory Management for Large Language Model Serving with PagedAttention
Paper β’ 2309.06180 β’ Published β’ 25
-
Language Modeling Is Compression
Paper β’ 2309.10668 β’ Published β’ 83 -
Baichuan 2: Open Large-scale Language Models
Paper β’ 2309.10305 β’ Published β’ 20 -
Chain-of-Verification Reduces Hallucination in Large Language Models
Paper β’ 2309.11495 β’ Published β’ 38 -
LMDX: Language Model-based Document Information Extraction and Localization
Paper β’ 2309.10952 β’ Published β’ 65
-
Efficient Memory Management for Large Language Model Serving with PagedAttention
Paper β’ 2309.06180 β’ Published β’ 25 -
Ambiguity-Aware In-Context Learning with Large Language Models
Paper β’ 2309.07900 β’ Published β’ 5 -
Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers
Paper β’ 2309.08532 β’ Published β’ 53 -
LASER: LLM Agent with State-Space Exploration for Web Navigation
Paper β’ 2309.08172 β’ Published β’ 13