amirabdullah19852020
/

interpreting_reward_models

Model card Files Files and versions Community

interpreting_reward_models / data /merged_contrastive_pythia_160m_hh_rlhf_activations_and_features.hf

Commit History

Upload folder using huggingface_hub

1ce405d
verified

amirabdullah19852020 commited on May 22