chanchan7
/

llama-7b-dpo-qlora

alignment-handbook

Generated from Trainer

4-bit precision

Model card Files Files and versions Metrics Training metrics Community

llama-7b-dpo-qlora / runs /Mar05_01-38-10_SYS-4029GP-TRT

1 contributor

History: 18 commits

chanchan7's picture

Model save

aef4b53 verified 7 months ago

events.out.tfevents.1709574193.SYS-4029GP-TRT.1942121.0

141 kB
LFS

Model save 7 months ago
events.out.tfevents.1709602315.SYS-4029GP-TRT.1942121.1

828 Bytes
LFS

Model save 7 months ago