-
Self-Alignment with Instruction Backtranslation
Paper • 2308.06259 • Published • 38 -
ReCLIP: Refine Contrastive Language Image Pre-Training with Source Free Domain Adaptation
Paper • 2308.03793 • Published • 9 -
From Sparse to Soft Mixtures of Experts
Paper • 2308.00951 • Published • 19 -
Revisiting DETR Pre-training for Object Detection
Paper • 2308.01300 • Published • 7
Collections
Discover the best community collections!
Collections including paper arxiv:2309.14525
-
Efficient RLHF: Reducing the Memory Usage of PPO
Paper • 2309.00754 • Published • 13 -
Statistical Rejection Sampling Improves Preference Optimization
Paper • 2309.06657 • Published • 13 -
Are Large Language Model-based Evaluators the Solution to Scaling Up Multilingual Evaluation?
Paper • 2309.07462 • Published • 3 -
Stabilizing RLHF through Advantage Model and Selective Rehearsal
Paper • 2309.10202 • Published • 9
-
Large-Scale Automatic Audiobook Creation
Paper • 2309.03926 • Published • 52 -
Agents: An Open-source Framework for Autonomous Language Agents
Paper • 2309.07870 • Published • 39 -
PDFTriage: Question Answering over Long, Structured Documents
Paper • 2309.08872 • Published • 51 -
StarCoder: may the source be with you!
Paper • 2305.06161 • Published • 28