PEFT
Safetensors
mixtral
alignment-handbook
trl
dpo
Generated from Trainer
4-bit precision
bitsandbytes
stealth-finance-v2-dpo-adapter / trainer_state.json
jan-hq's picture
Model save
c7f70d3 verified
File too large to display, you can check the raw version instead.