Text Generation
PEFT
Safetensors
trl
dpo
unsloth
conversational
Mistral-SLERP-Merged7B-DPO / training_args.bin

Commit History

ayoubkirouane/Mistral-SLERP-Merged7B-DPO
4907099
verified

ayoubkirouane commited on