Text Generation
Transformers
Safetensors
llama
alignment-handbook
trl
dpo
Generated from Trainer
conversational
text-generation-inference
Inference Endpoints
flydust commited on
Commit
60cf460
·
verified ·
1 Parent(s): 0b30eab

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +7 -0
README.md CHANGED
@@ -275,6 +275,13 @@ If you find the model, data, or code useful, please cite:
275
  archivePrefix={arXiv},
276
  primaryClass={cs.CL}
277
  }
 
 
 
 
 
 
 
278
  ```
279
 
280
  **Contact**
 
275
  archivePrefix={arXiv},
276
  primaryClass={cs.CL}
277
  }
278
+
279
+ @article{xu2024stronger,
280
+ title={Stronger Models are NOT Stronger Teachers for Instruction Tuning},
281
+ author={Xu, Zhangchen and Jiang, Fengqing and Niu, Luyao and Lin, Bill Yuchen and Poovendran, Radha},
282
+ journal={arXiv preprint arXiv:2411.07133},
283
+ year={2024}
284
+ }
285
  ```
286
 
287
  **Contact**