cmarkea
/

distilcamembert-base

Inference Endpoints

Model card Files Files and versions Community

Cyrile commited on Feb 7, 2022

Commit

a26eab8

•

1 Parent(s): f140f84

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -22,7 +22,7 @@ The training for the distilled model (student model) is designed to be the close
 The final loss function is a combination of these three losses functions. We use the following ponderation:
-$$Loss = 0.5 \times DistilLoss + 0.3 \times CosineLoss$$ + 0.2 \times MLMLoss
 Dataset
 -------

 The final loss function is a combination of these three losses functions. We use the following ponderation:
+$$Loss = 0.5 \times DistilLoss + 0.3 \times CosineLoss + 0.2 \times MLMLoss$$
 Dataset
 -------