lora-dpo-finetuned-stage4-full-sft-v3-0.5_5e-7_ep-1 / model-00005-of-00005.safetensors

Commit History

Upload LlamaForCausalLM
52aadcb
verified

S4nto commited on