lora-dpo-finetuned-stage4-sft-0.5-1e-6_ep-1 / model-00003-of-00005.safetensors

Commit History

Upload LlamaForCausalLM
0f5d2af
verified

S4nto commited on