alexwb
/

reward_modeling_anthropic_hh_rm1e-4

Generated from Trainer

Model card Files Files and versions Community

reward_modeling_anthropic_hh_rm1e-4 / vocab.json

Commit History

End of training

8ef1ce8
verified

alexwb commited on Aug 7