zfz1
/

deepseek-8b-orpo-lora

alignment-handbook

Generated from Trainer

Model card Files Files and versions Metrics Training metrics Community

deepseek-8b-orpo-lora / runs /Jul18_00-59-35_phe108-jieyuzhao-01

1 contributor

History: 2 commits

zfz1's picture

End of training

dd36c6b verified 4 months ago

events.out.tfevents.1721289833.phe108-jieyuzhao-01.335789.0

32.2 kB
LFS

Training in progress, step 312 4 months ago
events.out.tfevents.1721295716.phe108-jieyuzhao-01.335789.1

997 Bytes
LFS

End of training 4 months ago