parth-ptl-97
/

reinforcement-learning-human-feedback

Update config.json

d0af4d0 verified 5 months ago

No virus

130 Bytes

	{
	"reward_model": {
	"model_path": "parth-ptl-97/reinforcement-learning-human-feedback"
	},
	"model_type":"gpt2"
	}