Investigating Efficiently Extending Transformers for Long Input Summarization Paper • 2208.04347 • Published Aug 8, 2022
LiPO: Listwise Preference Optimization through Learning-to-Rank Paper • 2402.01878 • Published Feb 2 • 19
Self-Evaluation Improves Selective Generation in Large Language Models Paper • 2312.09300 • Published Dec 14, 2023 • 14
Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models Paper • 2312.06585 • Published Dec 11, 2023 • 28
Frontier Language Models are not Robust to Adversarial Arithmetic, or "What do I need to say so you agree 2+2=5? Paper • 2311.07587 • Published Nov 8, 2023 • 3
Calibrating Sequence likelihood Improves Conditional Language Generation Paper • 2210.00045 • Published Sep 30, 2022 • 1
Improving Large Language Model Fine-tuning for Solving Math Problems Paper • 2310.10047 • Published Oct 16, 2023 • 5
Small-scale proxies for large-scale Transformer training instabilities Paper • 2309.14322 • Published Sep 25, 2023 • 19
Statistical Rejection Sampling Improves Preference Optimization Paper • 2309.06657 • Published Sep 13, 2023 • 13
SLiC-HF: Sequence Likelihood Calibration with Human Feedback Paper • 2305.10425 • Published May 17, 2023 • 5
PEGASUS: Pre-training with Extracted Gap-sentences for Abstractive Summarization Paper • 1912.08777 • Published Dec 18, 2019 • 2
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer Paper • 1910.10683 • Published Oct 23, 2019 • 9