M4-ai
/

TinyMistral-248M-v2-cleaner

Text Classification

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Locutusque commited on Jan 8

Commit

127323e

•

1 Parent(s): f4091f4

Update README.md

Files changed (1) hide show

README.md +5 -1

README.md CHANGED Viewed

@@ -7,5 +7,9 @@ language:
 ---
 # Use cases
 This model is used to deep clean the Rhino dataset, making it a higher quality dataset. This model achieved an average MSE loss of 0.095 during training.
 # Training
-Using trl's RewardTrainer, this model was trained on berkeley-nest/Nectar. The dataset is curated on-the-fly during training, as explained in the Rhino repo.

 ---
 # Use cases
 This model is used to deep clean the Rhino dataset, making it a higher quality dataset. This model achieved an average MSE loss of 0.095 during training.
+We recommend to use the sigmoid function to turn the logits into probabilities:
+```python
+1 / (1 + torch.exp(logits))
+```
 # Training
+Using trl's RewardTrainer, this model was trained on berkeley-nest/Nectar. The dataset is curated on-the-fly during training, as explained in the Rhino repo.