alexwb
/

reward_modeling_anthropic_hh_rm1e-3

Generated from Trainer

Model card Files Files and versions Community

reward_modeling_anthropic_hh_rm1e-3 / merges.txt

alexwb's picture

End of training

57192f9 verified 3 months ago

history contribute delete

456 kB

File too large to display, you can check the raw version instead.