andreaskoepf
commited on
Commit
•
3d8dbaf
1
Parent(s):
bddac42
Update README.md
Browse files
README.md
CHANGED
@@ -3,14 +3,13 @@ license: apache-2.0
|
|
3 |
---
|
4 |
# Pythia 6.9B Based Reward Model
|
5 |
|
6 |
-
-
|
7 |
- wandb: https://wandb.ai/open-assistant/reward-model/runs/5xld9wmd
|
8 |
- checkpoint: 3500 steps
|
9 |
|
10 |
Compute was generously provided by [Stability AI](https://stability.ai/)
|
11 |
|
12 |
|
13 |
-
|
14 |
### How to use
|
15 |
|
16 |
```python
|
|
|
3 |
---
|
4 |
# Pythia 6.9B Based Reward Model
|
5 |
|
6 |
+
- base model: [andreaskoepf/pythia-6.9b-gpt4all-pretrain](https://huggingface.co/andreaskoepf/pythia-6.9b-gpt4all-pretrain)
|
7 |
- wandb: https://wandb.ai/open-assistant/reward-model/runs/5xld9wmd
|
8 |
- checkpoint: 3500 steps
|
9 |
|
10 |
Compute was generously provided by [Stability AI](https://stability.ai/)
|
11 |
|
12 |
|
|
|
13 |
### How to use
|
14 |
|
15 |
```python
|