alexwb
/

reward_modeling_anthropic_hh_rm0.99

Generated from Trainer

Model card Files Files and versions Community

reward_modeling_anthropic_hh_rm0.99 / merges.txt

Commit History

End of training

b7ff958
verified

alexwb commited on Aug 2