trl-lib
/

llama-7b-se-rl-peft

Model card Files Files and versions Community

kashif HF staff commited on Apr 5, 2023

Commit

de16483

•

1 Parent(s): 7072c92

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -9,7 +9,7 @@ tags:
 ---
 # Llama-se-rl-peft
-Adapter weights of an RL fine-tuned model based on LLaMa. Authored by Edward Beeching, Younes Belkada, Kashiv Rasul, Lewis Tunstall and Leandro von Werra.
 For more info check out the [blog post]() and [github example]().
@@ -33,7 +33,7 @@ The **Llama-se-rl** model inherits limitations and biases from the Llama model a
 ```bibtex
 @misc{beeching2023llama,
   title={StackLLaMa: An RL Fine-tuned LLaMa Model for Stack Exchange Question and Answering},
-  author={Beeching, Edward and Belkada, Younes and Rasul, Kashiv and Tunstall, Lewis and von Werra, Leandro},
   year={2023}
 }
 ```

 ---
 # Llama-se-rl-peft
+Adapter weights of an RL fine-tuned model based on LLaMa. Authored by Edward Beeching, Younes Belkada, Kashif Rasul, Lewis Tunstall and Leandro von Werra.
 For more info check out the [blog post]() and [github example]().
 ```bibtex
 @misc{beeching2023llama,
   title={StackLLaMa: An RL Fine-tuned LLaMa Model for Stack Exchange Question and Answering},
+  author={Beeching, Edward and Belkada, Younes and Rasul, Kashif and Tunstall, Lewis and von Werra, Leandro},
   year={2023}
 }
 ```