Transformers
Safetensors
English
deberta-v2
reward_model
reward-model
RLHF
evaluation
llm
instruction
reranking
Inference Endpoints
Files changed (1) hide show
  1. README.md +11 -1
README.md CHANGED
@@ -220,4 +220,14 @@ def tokenize_conv_pair(convAs: List[str], convBs: List[str]):
220
 
221
  ## Contact
222
 
223
- - [Discord Server Link](https://discord.gg/MrBt3PXdXc)
 
 
 
 
 
 
 
 
 
 
 
220
 
221
  ## Contact
222
 
223
+ - [Discord Server Link](https://discord.gg/MrBt3PXdXc)
224
+
225
+ ## Original Paper
226
+ ```
227
+ @inproceedings{llm-blender-2023,
228
+ title = "LLM-Blender: Ensembling Large Language Models with Pairwise Comparison and Generative Fusion",
229
+ author = "Jiang, Dongfu and Ren, Xiang and Lin, Bill Yuchen",
230
+ booktitle = "Proceedings of the 61th Annual Meeting of the Association for Computational Linguistics (ACL 2023)",
231
+ year = "2023"
232
+ }
233
+ ```