mzhaoshuai
/

Llama-2-7b-hf-conf-refalign

Text Generation

text-generation-inference

Model card Files Files and versions

mzhaoshuai commited on about 1 month ago

Commit

215bbbb

·

verified ·

1 Parent(s): 5323472

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -6,11 +6,11 @@ library_name: transformers
 pipeline_tag: text-generation
 ---
-# RefAlign: RL with Similarity-based Rewards :sparkles:
-The official implementation of [Learning from Reference Answers: Versatile Language Model Alignment without Binary Human Preference Data](https://huggingface.co/papers/2504.09895).
-Code: https://github.com/mzhaoshuai/RefAlign
 ## Introduction

 pipeline_tag: text-generation
 ---
+# RefAlign: RL with Similarity-based Rewards
+**GitHub repository**: https://github.com/mzhaoshuai/RefAlign
+**Paper**: [Learning from Reference Answers: Versatile Language Model Alignment without Binary Human Preference Data](https://huggingface.co/papers/2504.09895).
 ## Introduction