2 Dr. LLaMA: Improving Small Language Models in Domain-Specific QA via Generative Data Augmentation · 4 authors 1
1 RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs · 7 authors