Dongfu Jiang
commited on
Commit
•
edac579
1
Parent(s):
8d9ead8
Update README.md
Browse files
README.md
CHANGED
@@ -83,8 +83,8 @@ We test the pairwise comparison on
|
|
83 |
**While PairRM is a extremely small model (0.4B) based on deberta, the pairwise comparison aggrement performance approches GPT-4's performance!**
|
84 |
|
85 |
Two reasons to attribute:
|
86 |
-
- Our PairRM specically designed model arch for pairwise comparison through bidirectional attention (See paper for more details)
|
87 |
-
- The high-quality and large-scale human preference annotation data it was train on (see
|
88 |
|
89 |
|
90 |
## Usage Example
|
|
|
83 |
**While PairRM is a extremely small model (0.4B) based on deberta, the pairwise comparison aggrement performance approches GPT-4's performance!**
|
84 |
|
85 |
Two reasons to attribute:
|
86 |
+
- Our PairRM specically designed model arch for pairwise comparison through bidirectional attention (See LLM-blender paper for more details)
|
87 |
+
- The high-quality and large-scale human preference annotation data it was train on (see training dataset list on this hugging face page)
|
88 |
|
89 |
|
90 |
## Usage Example
|