andreaskoepf
/

oasst-rm-2-pythia-1.4b-10000

gpt_neox_reward_model

Inference Endpoints

Model card Files Files and versions Community

andreaskoepf commited on Apr 7, 2023

Commit

d52b99a

•

1 Parent(s): 442d8d5

Update README.md

Files changed (1) hide show

README.md +16 -0

README.md CHANGED Viewed

@@ -1,6 +1,22 @@
 ---
 license: apache-2.0
 ---
 wandb: https://wandb.ai/open-assistant/reward-model/runs/hdp2gnko
 checkpoint-10000

 ---
 license: apache-2.0
 ---
+How to use:
+```python
+# install open assistant model_training module (e.g. run `pip install -e .` in `model/` directory of open-assistant repository)
+import model_training.models.reward_model  # noqa: F401 (registers reward model for AutoModel loading)
+tokenizer = AutoTokenizer.from_pretrained(model_name)
+model = AutoModelForSequenceClassification.from_pretrained(model_name)
+input_text = "<|prompter|>Hi how are you?<|endoftext|><|assistant|>Hi, I am Open-Assistant a large open-source language model trained by LAION AI. How can I help you today?<|endoftext|>"
+inputs = tokenizer(input_text, return_tensors="pt")
+score = rm(**inputs).logits[0].cpu().detach()
+print(score)
+```
 wandb: https://wandb.ai/open-assistant/reward-model/runs/hdp2gnko
 checkpoint-10000