Ray2333
/

gpt2-large-harmless-reward_model

Text Classification

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Ray2333 commited on Jan 15

Commit

2f4c969

•

1 Parent(s): e200089

Update README.md

Files changed (1) hide show

README.md +4 -1

README.md CHANGED Viewed

@@ -10,8 +10,11 @@ GPT2 large model trained on Anthropic/hh-rlhf harmless dataset. It is specifical
 It achieves an accuracy of 0.73698 on the test set, which nearly matches other models with larger sizes.
-Usage:
 ```
 rm_tokenizer = AutoTokenizer.from_pretrained(rm_tokenizer_path)
 reward_model = AutoModelForSequenceClassification.from_pretrained(
                 reward_peft_path1,

 It achieves an accuracy of 0.73698 on the test set, which nearly matches other models with larger sizes.
+## Usage:
 ```
+import torch
+from transformers import AutoTokenizer, AutoModelForSequenceClassification
 rm_tokenizer = AutoTokenizer.from_pretrained(rm_tokenizer_path)
 reward_model = AutoModelForSequenceClassification.from_pretrained(
                 reward_peft_path1,