amirabdullah19852020
/

interpreting_reward_models

Model card Files Files and versions Community

interpreting_reward_models

1 contributor

History: 51 commits

amirabdullah19852020's picture

amirabdullah19852020

Delete data/merged_contrastive_gpt_neo_125m_from_model_rlhf__on_task_hh_rlhf_activations_dataset.hf

83b8cc7 verified about 1 month ago