Commit
•
127323e
1
Parent(s):
f4091f4
Update README.md
Browse files
README.md
CHANGED
@@ -7,5 +7,9 @@ language:
|
|
7 |
---
|
8 |
# Use cases
|
9 |
This model is used to deep clean the Rhino dataset, making it a higher quality dataset. This model achieved an average MSE loss of 0.095 during training.
|
|
|
|
|
|
|
|
|
10 |
# Training
|
11 |
-
Using trl's RewardTrainer, this model was trained on berkeley-nest/Nectar. The dataset is curated on-the-fly during training, as explained in the Rhino repo.
|
|
|
7 |
---
|
8 |
# Use cases
|
9 |
This model is used to deep clean the Rhino dataset, making it a higher quality dataset. This model achieved an average MSE loss of 0.095 during training.
|
10 |
+
We recommend to use the sigmoid function to turn the logits into probabilities:
|
11 |
+
```python
|
12 |
+
1 / (1 + torch.exp(logits))
|
13 |
+
```
|
14 |
# Training
|
15 |
+
Using trl's RewardTrainer, this model was trained on berkeley-nest/Nectar. The dataset is curated on-the-fly during training, as explained in the Rhino repo.
|