Aligning Large Language Models via Self-Steering Optimization Paper • 2410.17131 • Published Oct 22, 2024 • 22
A Unified View of Delta Parameter Editing in Post-Trained Large-Scale Models Paper • 2410.13841 • Published Oct 17, 2024 • 16
Towards a Unified View of Preference Learning for Large Language Models: A Survey Paper • 2409.02795 • Published Sep 4, 2024 • 72
Predicting Rewards Alongside Tokens: Non-disruptive Parameter Insertion for Efficient Inference Intervention in Large Language Model Paper • 2408.10764 • Published Aug 20, 2024 • 9
Predicting Rewards Alongside Tokens: Non-disruptive Parameter Insertion for Efficient Inference Intervention in Large Language Model Paper • 2408.10764 • Published Aug 20, 2024 • 9 • 2
Self-play with Execution Feedback: Improving Instruction-following Capabilities of Large Language Models Paper • 2406.13542 • Published Jun 19, 2024 • 16
#InsTag: Instruction Tagging for Analyzing Supervised Fine-tuning of Large Language Models Paper • 2308.07074 • Published Aug 14, 2023
How Abilities in Large Language Models are Affected by Supervised Fine-tuning Data Composition Paper • 2310.05492 • Published Oct 9, 2023 • 2
Query and Response Augmentation Cannot Help Out-of-domain Math Reasoning Generalization Paper • 2310.05506 • Published Oct 9, 2023 • 1
Online Merging Optimizers for Boosting Rewards and Mitigating Tax in Alignment Paper • 2405.17931 • Published May 28, 2024
Large Language Models are Superpositions of All Characters: Attaining Arbitrary Role-play via Self-Alignment Paper • 2401.12474 • Published Jan 23, 2024 • 35