Llama-3.2-3B-Instruct-DPO-Math / model-00001-of-00002.safetensors

Commit History

(Trained with Unsloth)
eafdb58
verified

arqa39 commited on