princeton-nlp 's Collections

SimPO

This collections contains the list of model being trained and evaluated in the preprint: SimPO: Simple Preference Optimization with a Reference-Free R