alexshengzhili/llama3.1-8b-lora_dpo_0907_preference_iclr2023 Text Generation • Updated 27 days ago • 17
alexshengzhili/phi3-dpo_0908_preference_4_conference_shuffled_2023 Text Generation • Updated 25 days ago • 14