opt-350m-hh-rlhf / reward_model /adapter_config.json

Commit History