princeton-nlp/Llama-3-Base-8B-SFT-IPO
Text Generation
•
Updated
•
9
This collections contains the list of model being trained and evaluated in the preprint: SimPO: Simple Preference Optimization with a Reference-Free R