alexwb
/

reward_modeling_anthropic_hh_rm0.9_lr5e-5

Generated from Trainer

Model card Files Files and versions Community

reward_modeling_anthropic_hh_rm0.9_lr5e-5 / tokenizer.json

alexwb's picture

End of training

718a024 verified 5 months ago

2.11 MB

File too large to display, you can check the raw version instead.