guocheng98
commited on
Commit
•
7b6ccdc
1
Parent(s):
2e34278
Update README.md
Browse files
README.md
CHANGED
@@ -17,7 +17,7 @@ It achieves the following results on the evaluation set:
|
|
17 |
|
18 |
## Model description
|
19 |
|
20 |
-
Transformer-based NMT model to translate from Spanish to Simplified Chinese.
|
21 |
|
22 |
## Intended uses & limitations
|
23 |
|
@@ -45,6 +45,8 @@ The following hyperparameters were used during training:
|
|
45 |
- lr_scheduler_warmup_steps: 2000
|
46 |
- num_epochs: 10
|
47 |
- mixed_precision_training: Native AMP
|
|
|
|
|
48 |
|
49 |
### Training results
|
50 |
|
|
|
17 |
|
18 |
## Model description
|
19 |
|
20 |
+
Transformer-based NMT model to translate from Spanish to Simplified Chinese, fine-tuned for legal domain.
|
21 |
|
22 |
## Intended uses & limitations
|
23 |
|
|
|
45 |
- lr_scheduler_warmup_steps: 2000
|
46 |
- num_epochs: 10
|
47 |
- mixed_precision_training: Native AMP
|
48 |
+
- weight_decay: 0.01
|
49 |
+
- early_stopping_patience: 8
|
50 |
|
51 |
### Training results
|
52 |
|