princeton-nlp
commited on
Commit
•
f646c99
1
Parent(s):
cb78d4b
Update README.md
Browse files
README.md
CHANGED
@@ -64,6 +64,7 @@ We used the following hyperparameters:
|
|
64 |
- learning rate: 5e-7
|
65 |
- batch size: 128
|
66 |
- beta: 0.01
|
|
|
67 |
The other hyperparameters are kept the same with our [SimPO recipe](https://github.com/princeton-nlp/SimPO/blob/main/training_configs/gemma-2-9b-it-simpo.yaml).
|
68 |
|
69 |
#### Speeds, Sizes, Times
|
|
|
64 |
- learning rate: 5e-7
|
65 |
- batch size: 128
|
66 |
- beta: 0.01
|
67 |
+
|
68 |
The other hyperparameters are kept the same with our [SimPO recipe](https://github.com/princeton-nlp/SimPO/blob/main/training_configs/gemma-2-9b-it-simpo.yaml).
|
69 |
|
70 |
#### Speeds, Sizes, Times
|