lora-dpo-finetuned-stage4-full-sft-v3-0.5_5e-7_ep-10 / model-00005-of-00005.safetensors

Commit History

Upload LlamaForCausalLM
6e63875
verified

S4nto commited on