GREEN-RadLlama2-7b / trainer_state.json

Commit History

StanfordAIMI/RewardRadLLaMA-7b
6eddb21
verified

zhjohnchan commited on