Guanzheng commited on
Commit
63f1058
·
verified ·
1 Parent(s): e1b9d75

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +15 -1
README.md CHANGED
@@ -147,4 +147,18 @@ train/train_longpo.py \
147
  | Mistral-7B-LongPO-256K-EXP | 59.47 | 60.28 | 83.14 | 78.14 | 70.26 |
148
  | Mistral-7B-LongPO-512K-EXP | 59.51 | 60.58 | 82.87 | 77.66 | 70.16 |
149
  | Qwen2.5-7B-Instruct | 74.28 | 67.15 | 81.41 | 74.66 | 74.38 |
150
- | Qwen2.5-7B-LongPO-128K | 73.64 | 65.70 | 80.82 | 74.98 | 73.79 |
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
147
  | Mistral-7B-LongPO-256K-EXP | 59.47 | 60.28 | 83.14 | 78.14 | 70.26 |
148
  | Mistral-7B-LongPO-512K-EXP | 59.51 | 60.58 | 82.87 | 77.66 | 70.16 |
149
  | Qwen2.5-7B-Instruct | 74.28 | 67.15 | 81.41 | 74.66 | 74.38 |
150
+ | Qwen2.5-7B-LongPO-128K | 73.64 | 65.70 | 80.82 | 74.98 | 73.79 |
151
+
152
+
153
+ ## Citation
154
+ If you find our project useful, hope you can star our repo and cite our paper as follows:
155
+ ```
156
+ @inproceedings{
157
+ chen2025longpo,
158
+ title={Long{PO}: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization},
159
+ author={Guanzheng Chen and Xin Li and Michael Shieh and Lidong Bing},
160
+ booktitle={The Thirteenth International Conference on Learning Representations},
161
+ year={2025},
162
+ url={https://openreview.net/forum?id=qTrEq31Shm}
163
+ }
164
+ ```