-
Adapting Large Language Models via Reading Comprehension
Paper • 2309.09530 • Published • 69 -
RLAIF: Scaling Reinforcement Learning from Human Feedback with AI Feedback
Paper • 2309.00267 • Published • 45 -
Self-Alignment with Instruction Backtranslation
Paper • 2308.06259 • Published • 38 -
Unnatural Instructions: Tuning Language Models with (Almost) No Human Labor
Paper • 2212.09689 • Published • 1
Collections
Discover the best community collections!
Collections including paper arxiv:2308.06259
-
Self-Alignment with Instruction Backtranslation
Paper • 2308.06259 • Published • 38 -
ReCLIP: Refine Contrastive Language Image Pre-Training with Source Free Domain Adaptation
Paper • 2308.03793 • Published • 9 -
From Sparse to Soft Mixtures of Experts
Paper • 2308.00951 • Published • 19 -
Revisiting DETR Pre-training for Object Detection
Paper • 2308.01300 • Published • 7
-
Large Language Models as Optimizers
Paper • 2309.03409 • Published • 72 -
One Wide Feedforward is All You Need
Paper • 2309.01826 • Published • 31 -
Self-Alignment with Instruction Backtranslation
Paper • 2308.06259 • Published • 38 -
Shepherd: A Critic for Language Model Generation
Paper • 2308.04592 • Published • 27