hendrydong commited on
Commit
e5e7aaf
1 Parent(s): 739cb2d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +10 -0
README.md CHANGED
@@ -1,3 +1,13 @@
 
 
 
 
 
 
 
 
 
 
1
  To use this model, you need to load by `AutoModelForSequenceClassification`,
2
  ```python
3
  model = AutoModelForSequenceClassification.from_pretrained(
 
1
+ # Training
2
+
3
+ The base model is `mistralai/Mistral-7B-Instruct-v0.2`.
4
+
5
+ We also merge the training script at https://github.com/WeiXiongUST/RLHF-Reward-Modeling.
6
+
7
+ Thanks Wei (https://huggingface.co/weqweasdas) for his help and contribution to the community.
8
+
9
+ # Usage
10
+
11
  To use this model, you need to load by `AutoModelForSequenceClassification`,
12
  ```python
13
  model = AutoModelForSequenceClassification.from_pretrained(