parth-ptl-97's picture
Update config.json
d0af4d0 verified
{
"reward_model": {
"model_path": "parth-ptl-97/reinforcement-learning-human-feedback"
},
"model_type":"gpt2"
}