M4-ai
/

TinyMistral-248M-v2-cleaner

Text Classification

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

TinyMistral-248M-v2-cleaner / README.md

Locutusque's picture

Update README.md

127323e 7 months ago

|

history blame contribute delete

No virus

533 Bytes

	---
	license: apache-2.0
	datasets:
	- berkeley-nest/Nectar
	language:
	- en
	---
	# Use cases
	This model is used to deep clean the Rhino dataset, making it a higher quality dataset. This model achieved an average MSE loss of 0.095 during training.
	We recommend to use the sigmoid function to turn the logits into probabilities:
	```python
	1 / (1 + torch.exp(logits))
	```
	# Training
	Using trl's RewardTrainer, this model was trained on berkeley-nest/Nectar. The dataset is curated on-the-fly during training, as explained in the Rhino repo.