Edit model card

A reward model fine-tuned from gemma-2b-it following the recipe of RLHF-Reward-Modeling.

Downloads last month
1
Safetensors
Model size
2.51B params
Tensor type
F32
·
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.