GENIAC-Team-Ozaki/lora-dpo-finetuned-stage4-full-sft-v3-0.5_5e-7_ep-15 Text Generation • Updated May 25 • 2
wentingzhao/Meta-Llama-3-8B-Instruct-model-a_iteration0-lr2e-6-bs8 Text Generation • Updated May 25 • 2
wentingzhao/Meta-Llama-3-8B-Instruct-model-b_iteration0-lr2e-6-bs8 Text Generation • Updated May 25 • 1
GENIAC-Team-Ozaki/lora-dpo-finetuned-stage4-full-sft-v3-0.5_5e-7_ep-5 Text Generation • Updated May 25 • 2
GENIAC-Team-Ozaki/lora-dpo-finetuned-stage4-full-sft-v3-0.5_5e-7_ep-10 Text Generation • Updated May 25 • 2