pythia-2.8b-deduped-tldr-rm / model-00001-of-00002.safetensors

Commit History

Add cleanrl/EleutherAI_pythia-2.8b-deduped__reward__tldr-main checkpoint
ab72981
verified

edbeeching HF staff commited on