evan-nexusflow commited on
Commit
358287c
·
verified ·
1 Parent(s): 5b1310f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +34 -0
README.md CHANGED
@@ -1,3 +1,25 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ### Usage
2
 
3
  ```python
@@ -91,4 +113,16 @@ messages = [
91
 
92
  print(pipe([messages])) # Print the reward!
93
 
 
 
 
 
 
 
 
 
 
 
 
 
94
  ```
 
1
+ ---
2
+ license: other
3
+ language:
4
+ - en
5
+ library_name: transformers
6
+ tags:
7
+ - RLHF
8
+ - Nexusflow
9
+ - Athene
10
+ - Reward Model
11
+ ---
12
+
13
+ # Llama3-Athene-RM-8B
14
+
15
+ We introduce Llama3-Athene-RM-8B, an open-weights reward model based off Llama-3-8B-Instruct.
16
+
17
+ - **Developed by:** The Nexusflow Team (Evan Frick\*, Peter Jin\*, Tianle Li\*, Karthik Ganesan, Jian Zhang, Jiantao Jiao and Banghua Zhu).
18
+ - **Model type:** Reward Model
19
+ - **Finetuned from model:** [Llama-3-8B-Instruct](https://huggingface.co/meta-llama/Meta-Llama-3-8B-Instruct).
20
+ - **License**: [Nexusflow Research License](https://huggingface.co/Nexusflow/Athene-70B/blob/main/Nexusflow_Research_License.pdf)
21
+ - **Blog**: https://nexusflow.ai/blogs/athene
22
+ -
23
  ### Usage
24
 
25
  ```python
 
113
 
114
  print(pipe([messages])) # Print the reward!
115
 
116
+ ```
117
+
118
+ ### Citation
119
+
120
+ ```
121
+ @misc{Athene2024,
122
+ title = {Athene-70B: Redefining the Boundaries of Post-Training for Open Models},
123
+ url = {https://nexusflow.ai/blogs/athene},
124
+ author = {Frick, Evan and Jin, Peter and Li, Tianle and Ganesan, Karthik and Zhang, Jian and Jiao, Jiantao and Zhu, Banghua},
125
+ month = {July},
126
+ year = {2024}
127
+ }
128
  ```