rossanez commited on
Commit
54a4920
1 Parent(s): eeed5a9

update model card README.md

Browse files
Files changed (1) hide show
  1. README.md +11 -11
README.md CHANGED
@@ -19,7 +19,7 @@ model-index:
19
  metrics:
20
  - name: Bleu
21
  type: bleu
22
- value: 8.2154
23
  ---
24
 
25
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -29,9 +29,9 @@ should probably proofread and complete it, then remove this comment. -->
29
 
30
  This model is a fine-tuned version of [t5-small](https://huggingface.co/t5-small) on the wmt14 dataset.
31
  It achieves the following results on the evaluation set:
32
- - Loss: 2.0694
33
- - Bleu: 8.2154
34
- - Gen Len: 17.3996
35
 
36
  ## Model description
37
 
@@ -51,8 +51,8 @@ More information needed
51
 
52
  The following hyperparameters were used during training:
53
  - learning_rate: 2e-05
54
- - train_batch_size: 8
55
- - eval_batch_size: 8
56
  - seed: 42
57
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
58
  - lr_scheduler_type: linear
@@ -63,11 +63,11 @@ The following hyperparameters were used during training:
63
 
64
  | Training Loss | Epoch | Step | Validation Loss | Bleu | Gen Len |
65
  |:-------------:|:-----:|:----:|:---------------:|:------:|:-------:|
66
- | No log | 1.0 | 375 | 2.0923 | 8.0197 | 17.4046 |
67
- | 2.3091 | 2.0 | 750 | 2.0806 | 8.0314 | 17.4186 |
68
- | 2.2602 | 3.0 | 1125 | 2.0746 | 8.1423 | 17.4033 |
69
- | 2.2337 | 4.0 | 1500 | 2.0702 | 8.2025 | 17.4029 |
70
- | 2.2337 | 5.0 | 1875 | 2.0694 | 8.2154 | 17.3996 |
71
 
72
 
73
  ### Framework versions
19
  metrics:
20
  - name: Bleu
21
  type: bleu
22
+ value: 5.8913
23
  ---
24
 
25
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
29
 
30
  This model is a fine-tuned version of [t5-small](https://huggingface.co/t5-small) on the wmt14 dataset.
31
  It achieves the following results on the evaluation set:
32
+ - Loss: 2.2040
33
+ - Bleu: 5.8913
34
+ - Gen Len: 17.5408
35
 
36
  ## Model description
37
 
51
 
52
  The following hyperparameters were used during training:
53
  - learning_rate: 2e-05
54
+ - train_batch_size: 16
55
+ - eval_batch_size: 16
56
  - seed: 42
57
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
58
  - lr_scheduler_type: linear
63
 
64
  | Training Loss | Epoch | Step | Validation Loss | Bleu | Gen Len |
65
  |:-------------:|:-----:|:----:|:---------------:|:------:|:-------:|
66
+ | No log | 1.0 | 188 | 2.3366 | 2.8075 | 17.8188 |
67
+ | No log | 2.0 | 376 | 2.2557 | 4.8765 | 17.626 |
68
+ | 2.6928 | 3.0 | 564 | 2.2246 | 5.5454 | 17.5534 |
69
+ | 2.6928 | 4.0 | 752 | 2.2086 | 5.8511 | 17.5461 |
70
+ | 2.6928 | 5.0 | 940 | 2.2040 | 5.8913 | 17.5408 |
71
 
72
 
73
  ### Framework versions