rossanez
/

t5-small-finetuned-de-en-epochs5

Text2Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

rossanez commited on Dec 4, 2021

Commit

54a4920

•

1 Parent(s): eeed5a9

update model card README.md

Files changed (1) hide show

README.md +11 -11

README.md CHANGED Viewed

@@ -19,7 +19,7 @@ model-index:
     metrics:
     - name: Bleu
       type: bleu
-      value: 8.2154
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -29,9 +29,9 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [t5-small](https://huggingface.co/t5-small) on the wmt14 dataset.
 It achieves the following results on the evaluation set:
-- Loss: 2.0694
-- Bleu: 8.2154
-- Gen Len: 17.3996
 ## Model description
@@ -51,8 +51,8 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
-- train_batch_size: 8
-- eval_batch_size: 8
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
@@ -63,11 +63,11 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss | Bleu   | Gen Len |
 |:-------------:|:-----:|:----:|:---------------:|:------:|:-------:|
-| No log        | 1.0   | 375  | 2.0923          | 8.0197 | 17.4046 |
-| 2.3091        | 2.0   | 750  | 2.0806          | 8.0314 | 17.4186 |
-| 2.2602        | 3.0   | 1125 | 2.0746          | 8.1423 | 17.4033 |
-| 2.2337        | 4.0   | 1500 | 2.0702          | 8.2025 | 17.4029 |
-| 2.2337        | 5.0   | 1875 | 2.0694          | 8.2154 | 17.3996 |
 ### Framework versions

     metrics:
     - name: Bleu
       type: bleu
+      value: 5.8913
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 This model is a fine-tuned version of [t5-small](https://huggingface.co/t5-small) on the wmt14 dataset.
 It achieves the following results on the evaluation set:
+- Loss: 2.2040
+- Bleu: 5.8913
+- Gen Len: 17.5408
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
+- train_batch_size: 16
+- eval_batch_size: 16
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 | Training Loss | Epoch | Step | Validation Loss | Bleu   | Gen Len |
 |:-------------:|:-----:|:----:|:---------------:|:------:|:-------:|
+| No log        | 1.0   | 188  | 2.3366          | 2.8075 | 17.8188 |
+| No log        | 2.0   | 376  | 2.2557          | 4.8765 | 17.626  |
+| 2.6928        | 3.0   | 564  | 2.2246          | 5.5454 | 17.5534 |
+| 2.6928        | 4.0   | 752  | 2.2086          | 5.8511 | 17.5461 |
+| 2.6928        | 5.0   | 940  | 2.2040          | 5.8913 | 17.5408 |
 ### Framework versions