Transformers
Safetensors
English
deberta-v2
reward_model
reward-model
RLHF
evaluation
llm
instruction
reranking
Inference Endpoints

Great job! Please consider adding the citation info.

#1
by yuchenlin - opened

Thanks for this great re-implementation of our PairRM, and it is amazing to see that the performance is improved!

Would you please add our paper citation information to the end of the model card page? Thanks!

@inproceedings{llm-blender-2023,
    title = "LLM-Blender: Ensembling Large Language Models with Pairwise Comparison and Generative Fusion",
    author = "Jiang, Dongfu and Ren, Xiang and Lin, Bill Yuchen",
    booktitle = "Proceedings of the 61th Annual Meeting of the Association for Computational Linguistics (ACL 2023)",
    year = "2023"
}
yuchenlin changed discussion title from Great job! to Great job! Please consider adding the citation info.
  • This is my main account.

I'm impressed with the approach you've demonstrated in PairRM. Thank you for amazing model.

I'll add it right away! If you have any question or anything to talk about don't hesitate to contact me.

Have a nice day.

mightbe changed discussion status to closed

Sign up or log in to comment