lora-dpo-finetuned-stage4-full-sft-v3-0.1_5e-7_ep-15 / model-00004-of-00005.safetensors

Commit History

Upload LlamaForCausalLM
0f47fac
verified

S4nto commited on