FuseChat 3.0 Collection Preference Optimization for Implicit Model Fusion • 11 items • Updated 15 days ago • 7
Weighted-Reward Preference Optimization for Implicit Model Fusion Paper • 2412.03187 • Published 23 days ago • 9