Transformers
PyTorch
English
trl
rlhf
kashif HF staff commited on
Commit
de16483
1 Parent(s): 7072c92

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -9,7 +9,7 @@ tags:
9
  ---
10
 
11
  # Llama-se-rl-peft
12
- Adapter weights of an RL fine-tuned model based on LLaMa. Authored by Edward Beeching, Younes Belkada, Kashiv Rasul, Lewis Tunstall and Leandro von Werra.
13
  For more info check out the [blog post]() and [github example]().
14
 
15
 
@@ -33,7 +33,7 @@ The **Llama-se-rl** model inherits limitations and biases from the Llama model a
33
  ```bibtex
34
  @misc{beeching2023llama,
35
  title={StackLLaMa: An RL Fine-tuned LLaMa Model for Stack Exchange Question and Answering},
36
- author={Beeching, Edward and Belkada, Younes and Rasul, Kashiv and Tunstall, Lewis and von Werra, Leandro},
37
  year={2023}
38
  }
39
  ```
 
9
  ---
10
 
11
  # Llama-se-rl-peft
12
+ Adapter weights of an RL fine-tuned model based on LLaMa. Authored by Edward Beeching, Younes Belkada, Kashif Rasul, Lewis Tunstall and Leandro von Werra.
13
  For more info check out the [blog post]() and [github example]().
14
 
15
 
 
33
  ```bibtex
34
  @misc{beeching2023llama,
35
  title={StackLLaMa: An RL Fine-tuned LLaMa Model for Stack Exchange Question and Answering},
36
+ author={Beeching, Edward and Belkada, Younes and Rasul, Kashif and Tunstall, Lewis and von Werra, Leandro},
37
  year={2023}
38
  }
39
  ```