PEFT
Safetensors
qwen2
alignment-handbook
trl
dpo
Generated from Trainer
Qwen2-7B-Instruct-SPPO-Function-call-v2.6 / adapter_model.safetensors

Commit History

Training in progress, step 1091
e303bc6
verified

khongtrunght commited on

Training in progress, step 1000
1bc1667
verified

khongtrunght commited on

Training in progress, step 900
93c94a5
verified

khongtrunght commited on

Training in progress, step 800
879542f
verified

khongtrunght commited on

Training in progress, step 700
b538dc9
verified

khongtrunght commited on

Training in progress, step 600
e91d931
verified

khongtrunght commited on

Training in progress, step 500
587a4a1
verified

khongtrunght commited on

Training in progress, step 400
223bf41
verified

khongtrunght commited on

Training in progress, step 300
f823698
verified

khongtrunght commited on

Training in progress, step 200
8a1137c
verified

khongtrunght commited on

Training in progress, step 100
d6f7852
verified

khongtrunght commited on