Nexusflow
/

Athene-RM-8B

Text Classification

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

evan-nexusflow commited on Oct 21, 2024

Commit

358287c

·

verified ·

1 Parent(s): 5b1310f

Update README.md

Files changed (1) hide show

README.md +34 -0

README.md CHANGED Viewed

@@ -1,3 +1,25 @@
 ### Usage
 ```python
@@ -91,4 +113,16 @@ messages = [
 print(pipe([messages])) # Print the reward!
 ```

+---
+license: other
+language:
+- en
+library_name: transformers
+tags:
+- RLHF
+- Nexusflow
+- Athene
+- Reward Model
+---
+# Llama3-Athene-RM-8B
+We introduce Llama3-Athene-RM-8B, an open-weights reward model based off Llama-3-8B-Instruct.
+- **Developed by:** The Nexusflow Team (Evan Frick\*, Peter Jin\*, Tianle Li\*, Karthik Ganesan, Jian Zhang, Jiantao Jiao and Banghua Zhu).
+- **Model type:** Reward Model
+- **Finetuned from model:** [Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct).
+- **License**: [Nexusflow Research License](https://huggingface.co/Nexusflow/Athene-70B/blob/main/Nexusflow_Research_License.pdf)
+- **Blog**: https://nexusflow.ai/blogs/athene
+-
 ### Usage
 ```python
 print(pipe([messages])) # Print the reward!
+```
+### Citation
+```
+@misc{Athene2024,
+    title = {Athene-70B: Redefining the Boundaries of Post-Training for Open Models},
+    url = {https://nexusflow.ai/blogs/athene},
+    author = {Frick, Evan and Jin, Peter and Li, Tianle and Ganesan, Karthik and Zhang, Jian and Jiao, Jiantao and Zhu, Banghua},
+    month = {July},
+    year = {2024}
+}
 ```