weqweasdas commited on
Commit
3f8102a
1 Parent(s): fc1057c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -0
README.md CHANGED
@@ -8,6 +8,7 @@
8
 
9
  The reward model is trained from the base model [google/gemma-7b-it](https://huggingface.co/google/gemma-7b-it).
10
 
 
11
 
12
  ## Model Details
13
 
 
8
 
9
  The reward model is trained from the base model [google/gemma-7b-it](https://huggingface.co/google/gemma-7b-it).
10
 
11
+ The training script is available at https://github.com/WeiXiongUST/RLHF-Reward-Modeling .
12
 
13
  ## Model Details
14