weqweasdas
/

RM-Gemma-7B

Text Classification

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

weqweasdas commited on Mar 22, 2024

Commit

3f8102a

·

verified ·

1 Parent(s): fc1057c

Update README.md

Files changed (1) hide show

README.md +1 -0

README.md CHANGED Viewed

@@ -8,6 +8,7 @@
 The reward model is trained from the base model [google/gemma-7b-it](https://huggingface.co/google/gemma-7b-it).
 ## Model Details

 The reward model is trained from the base model [google/gemma-7b-it](https://huggingface.co/google/gemma-7b-it).
+The training script is available at https://github.com/WeiXiongUST/RLHF-Reward-Modeling .
 ## Model Details