HAJIWEE commited on
Commit
3c33eba
1 Parent(s): 7e447b7

update model card README.md

Browse files
Files changed (1) hide show
  1. README.md +13 -14
README.md CHANGED
@@ -21,7 +21,7 @@ model-index:
21
  metrics:
22
  - name: Bleu
23
  type: bleu
24
- value: 46.9185
25
  ---
26
 
27
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -29,11 +29,11 @@ should probably proofread and complete it, then remove this comment. -->
29
 
30
  # en2zh_opus_100
31
 
32
- This model is a fine-tuned version of [HAJIWEE/en2zh_opus_100](https://huggingface.co/HAJIWEE/en2zh_opus_100) on the opus100 dataset.
33
  It achieves the following results on the evaluation set:
34
- - Loss: 0.5395
35
- - Bleu: 46.9185
36
- - Gen Len: 7.686
37
 
38
  ## Model description
39
 
@@ -52,26 +52,25 @@ More information needed
52
  ### Training hyperparameters
53
 
54
  The following hyperparameters were used during training:
55
- - learning_rate: 0.0004
56
- - train_batch_size: 16
57
- - eval_batch_size: 16
58
  - seed: 42
59
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
60
  - lr_scheduler_type: linear
61
  - num_epochs: 2
62
- - mixed_precision_training: Native AMP
63
 
64
  ### Training results
65
 
66
  | Training Loss | Epoch | Step | Validation Loss | Bleu | Gen Len |
67
  |:-------------:|:-----:|:------:|:---------------:|:-------:|:-------:|
68
- | 0.5549 | 1.0 | 62500 | 0.5596 | 46.7419 | 7.7295 |
69
- | 0.5389 | 2.0 | 125000 | 0.5395 | 46.9185 | 7.686 |
70
 
71
 
72
  ### Framework versions
73
 
74
- - Transformers 4.27.1
75
- - Pytorch 1.13.1+cu116
76
- - Datasets 2.10.1
77
  - Tokenizers 0.13.2
 
21
  metrics:
22
  - name: Bleu
23
  type: bleu
24
+ value: 32.1971
25
  ---
26
 
27
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
29
 
30
  # en2zh_opus_100
31
 
32
+ This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on the opus100 dataset.
33
  It achieves the following results on the evaluation set:
34
+ - Loss: 1.0868
35
+ - Bleu: 32.1971
36
+ - Gen Len: 7.209
37
 
38
  ## Model description
39
 
 
52
  ### Training hyperparameters
53
 
54
  The following hyperparameters were used during training:
55
+ - learning_rate: 2e-05
56
+ - train_batch_size: 6
57
+ - eval_batch_size: 6
58
  - seed: 42
59
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
60
  - lr_scheduler_type: linear
61
  - num_epochs: 2
 
62
 
63
  ### Training results
64
 
65
  | Training Loss | Epoch | Step | Validation Loss | Bleu | Gen Len |
66
  |:-------------:|:-----:|:------:|:---------------:|:-------:|:-------:|
67
+ | 1.2742 | 1.0 | 166667 | 1.2032 | 28.4915 | 7.303 |
68
+ | 1.0931 | 2.0 | 333334 | 1.0868 | 32.1971 | 7.209 |
69
 
70
 
71
  ### Framework versions
72
 
73
+ - Transformers 4.27.4
74
+ - Pytorch 2.0.0
75
+ - Datasets 2.11.0
76
  - Tokenizers 0.13.2