PEFT
Safetensors
qwen2
alignment-handbook
trl
dpo
Generated from Trainer

This model has 1 file scanned as suspicious.

khongtrunght's picture
Model save
844f260 verified