Commit
•
7072c92
1
Parent(s):
0452f71
Update README.md
Browse files
README.md
CHANGED
@@ -8,8 +8,9 @@ tags:
|
|
8 |
- reinforcement-learning
|
9 |
---
|
10 |
|
11 |
-
# Llama-se-rl-
|
12 |
Adapter weights of an RL fine-tuned model based on LLaMa. Authored by Edward Beeching, Younes Belkada, Kashiv Rasul, Lewis Tunstall and Leandro von Werra.
|
|
|
13 |
|
14 |
|
15 |
## Model Description
|
|
|
8 |
- reinforcement-learning
|
9 |
---
|
10 |
|
11 |
+
# Llama-se-rl-peft
|
12 |
Adapter weights of an RL fine-tuned model based on LLaMa. Authored by Edward Beeching, Younes Belkada, Kashiv Rasul, Lewis Tunstall and Leandro von Werra.
|
13 |
+
For more info check out the [blog post]() and [github example]().
|
14 |
|
15 |
|
16 |
## Model Description
|