llm-blender
/

PairRM

Text Generation

Inference Endpoints

Model card Files Files and versions Community

Dongfu Jiang commited on Nov 11, 2023

Commit

edac579

•

1 Parent(s): 8d9ead8

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -83,8 +83,8 @@ We test the pairwise comparison on
 **While PairRM is a extremely small model (0.4B) based on deberta, the pairwise comparison aggrement performance approches GPT-4's performance!**
 Two reasons to attribute:
-- Our PairRM specically designed model arch for pairwise comparison through bidirectional attention (See paper for more details)
-- The high-quality and large-scale human preference annotation data it was train on (see tags for list)
 ## Usage Example

 **While PairRM is a extremely small model (0.4B) based on deberta, the pairwise comparison aggrement performance approches GPT-4's performance!**
 Two reasons to attribute:
+- Our PairRM specically designed model arch for pairwise comparison through bidirectional attention (See LLM-blender paper for more details)
+- The high-quality and large-scale human preference annotation data it was train on (see training dataset list on this hugging face page)
 ## Usage Example