lora-dpo-finetuned-stage4-full-sft-v3-0.1_5e-7_ep-1 / model-00001-of-00005.safetensors

Commit History

Upload LlamaForCausalLM
773ad26
verified

S4nto commited on