Text Classification
Transformers
PyTorch
English
deberta-v2
reward-model
reward_model
RLHF
Inference Endpoints
theblackcat102 commited on
Commit
d19d4ef
1 Parent(s): 24b78a2

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +11 -0
README.md CHANGED
@@ -32,6 +32,17 @@ All models are train on these dataset with a same split seed across datasets (if
32
 
33
  - [synthetic-instruct-gptj-pairwise](https://huggingface.co/datasets/Dahoas/synthetic-instruct-gptj-pairwise)
34
 
 
 
 
 
 
 
 
 
 
 
 
35
 
36
  # Performance
37
 
 
32
 
33
  - [synthetic-instruct-gptj-pairwise](https://huggingface.co/datasets/Dahoas/synthetic-instruct-gptj-pairwise)
34
 
35
+ # How to use
36
+
37
+ ```
38
+ from transformer import AutoModelForSequenceClassification, AutoTokenizer
39
+ reward_name = "OpenAssistant/reward-model-deberta-v3-large"
40
+ rank_model, tokenizer = AutoModelForSequenceClassification.from_pretrained(reward_name), AutoTokenizer.from_pretrained(reward_name)
41
+ question, answer = "Explain nuclear fusion like I am five", "Nuclear fusion is the process by which two or more protons and neutrons combine to form a single nucleus. It is a very important process in the universe, as it is the source of energy for stars and galaxies. Nuclear fusion is also a key process in the production of energy for nuclear power plants."
42
+ inputs = tokenizer(question, answer, return_tensors='pt')
43
+ score = rank_model(**inputs).logits[0].cpu().detach()
44
+ print(score)
45
+ ```
46
 
47
  # Performance
48