cmarkea
/

distilcamembert-base

Inference Endpoints

Model card Files Files and versions Community

Cyrile commited on Jan 13, 2022

Commit

3141ee4

·

1 Parent(s): df1a58e

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -22,7 +22,7 @@ The training for the distilled model (student model) is designed to be the close
 The final loss function is a combination of these three losses functions. We use the following ponderation:
-*Loss = 0.5 DistilLoss + 0.2 MLMLoss + 0.3 CosineLoss*
 Dataset
 -------

 The final loss function is a combination of these three losses functions. We use the following ponderation:
+$$Loss = 0.5 \times DistilLoss + 0.2 \times MLMLoss + 0.3 \times CosineLoss$$
 Dataset
 -------