trl-lib
/

llama-7b-se-rl-peft

Model card Files Files and versions Community

edbeeching HF staff commited on Apr 5, 2023

Commit

7072c92

•

1 Parent(s): 0452f71

Update README.md

Files changed (1) hide show

README.md +2 -1

README.md CHANGED Viewed

@@ -8,8 +8,9 @@ tags:
 - reinforcement-learning
 ---
-# Llama-se-rl-adapter
 Adapter weights of an RL fine-tuned model based on LLaMa. Authored by Edward Beeching, Younes Belkada, Kashiv Rasul, Lewis Tunstall and Leandro von Werra.
 ## Model Description

 - reinforcement-learning
 ---
+# Llama-se-rl-peft
 Adapter weights of an RL fine-tuned model based on LLaMa. Authored by Edward Beeching, Younes Belkada, Kashiv Rasul, Lewis Tunstall and Leandro von Werra.
+For more info check out the [blog post]() and [github example]().
 ## Model Description