llm-blender
/

PairRM

Text Generation

Inference Endpoints

Model card Files Files and versions Community

Dongfu Jiang commited on Nov 11, 2023

Commit

8d9ead8

•

1 Parent(s): 00b7e60

Update README.md

Files changed (1) hide show

README.md +2 -1

README.md CHANGED Viewed

@@ -80,7 +80,8 @@ We test the pairwise comparison on
 |           PairRM          |   **84.75**   |   84.48   | **80.33** | **90.7** |  **84.62**  |         **59**        |
 |        GPT -4-0613        |     91.53     |    93.1   |   85.25   |   83.72  |    88.69    |         63.87         |
-While PairRM is a extremely small model (0.4B) based on deberta, the pairwise comparison aggrement performance approches GPT-4's performance!
 Two reasons to attribute:
 - Our PairRM specically designed model arch for pairwise comparison through bidirectional attention (See paper for more details)
 - The high-quality and large-scale human preference annotation data it was train on (see tags for list)

 |           PairRM          |   **84.75**   |   84.48   | **80.33** | **90.7** |  **84.62**  |         **59**        |
 |        GPT -4-0613        |     91.53     |    93.1   |   85.25   |   83.72  |    88.69    |         63.87         |
+**While PairRM is a extremely small model (0.4B) based on deberta, the pairwise comparison aggrement performance approches GPT-4's performance!**
 Two reasons to attribute:
 - Our PairRM specically designed model arch for pairwise comparison through bidirectional attention (See paper for more details)
 - The high-quality and large-scale human preference annotation data it was train on (see tags for list)