mzhaoshuai commited on
Commit
215bbbb
·
verified ·
1 Parent(s): 5323472

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -3
README.md CHANGED
@@ -6,11 +6,11 @@ library_name: transformers
6
  pipeline_tag: text-generation
7
  ---
8
 
9
- # RefAlign: RL with Similarity-based Rewards :sparkles:
10
 
11
- The official implementation of [Learning from Reference Answers: Versatile Language Model Alignment without Binary Human Preference Data](https://huggingface.co/papers/2504.09895).
12
 
13
- Code: https://github.com/mzhaoshuai/RefAlign
14
 
15
  ## Introduction
16
 
 
6
  pipeline_tag: text-generation
7
  ---
8
 
9
+ # RefAlign: RL with Similarity-based Rewards
10
 
11
+ **GitHub repository**: https://github.com/mzhaoshuai/RefAlign
12
 
13
+ **Paper**: [Learning from Reference Answers: Versatile Language Model Alignment without Binary Human Preference Data](https://huggingface.co/papers/2504.09895).
14
 
15
  ## Introduction
16