ShenaoZhang
/

0.001_idpo_noreplacerej_iter_1

Text Generation

alignment-handbook

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

0.001_idpo_noreplacerej_iter_1

1 contributor

History: 6 commits

ShenaoZhang's picture

End of training

4519c86 verified 4 months ago