Locutusque commited on
Commit
127323e
1 Parent(s): f4091f4

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +5 -1
README.md CHANGED
@@ -7,5 +7,9 @@ language:
7
  ---
8
  # Use cases
9
  This model is used to deep clean the Rhino dataset, making it a higher quality dataset. This model achieved an average MSE loss of 0.095 during training.
 
 
 
 
10
  # Training
11
- Using trl's RewardTrainer, this model was trained on berkeley-nest/Nectar. The dataset is curated on-the-fly during training, as explained in the Rhino repo.
 
7
  ---
8
  # Use cases
9
  This model is used to deep clean the Rhino dataset, making it a higher quality dataset. This model achieved an average MSE loss of 0.095 during training.
10
+ We recommend to use the sigmoid function to turn the logits into probabilities:
11
+ ```python
12
+ 1 / (1 + torch.exp(logits))
13
+ ```
14
  # Training
15
+ Using trl's RewardTrainer, this model was trained on berkeley-nest/Nectar. The dataset is curated on-the-fly during training, as explained in the Rhino repo.