ad019el
/

m2m100_418M-finetuned-tq-to-ar-1

Text2Text Generation

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

ad019el commited on Aug 25, 2023

Commit

1408813

•

1 Parent(s): c52f047

End of training

Files changed (2) hide show

README.md +30 -4
pytorch_model.bin +1 -1

README.md CHANGED Viewed

@@ -1,8 +1,9 @@
 ---
-license: mit
 base_model: ad019el/m2m100_418M-finetuned-tq-to-ar
 tags:
 - generated_from_trainer
 model-index:
 - name: m2m100_418M-finetuned-tq-to-ar-1
   results: []
@@ -13,7 +14,11 @@ should probably proofread and complete it, then remove this comment. -->
 # m2m100_418M-finetuned-tq-to-ar-1
-This model is a fine-tuned version of [ad019el/m2m100_418M-finetuned-tq-to-ar](https://huggingface.co/ad019el/m2m100_418M-finetuned-tq-to-ar) on an unknown dataset.
 ## Model description
@@ -40,9 +45,30 @@ The following hyperparameters were used during training:
 - lr_scheduler_type: linear
 - num_epochs: 15
 ### Framework versions
 - Transformers 4.32.0
-- Pytorch 2.0.0
-- Datasets 2.1.0
 - Tokenizers 0.13.3

 ---
 base_model: ad019el/m2m100_418M-finetuned-tq-to-ar
 tags:
 - generated_from_trainer
+metrics:
+- bleu
 model-index:
 - name: m2m100_418M-finetuned-tq-to-ar-1
   results: []
 # m2m100_418M-finetuned-tq-to-ar-1
+This model is a fine-tuned version of [ad019el/m2m100_418M-finetuned-tq-to-ar](https://huggingface.co/ad019el/m2m100_418M-finetuned-tq-to-ar) on the None dataset.
+It achieves the following results on the evaluation set:
+- Loss: 2.2002
+- Bleu: 3.6349
+- Gen Len: 35.5271
 ## Model description
 - lr_scheduler_type: linear
 - num_epochs: 15
+### Training results
+| Training Loss | Epoch | Step | Validation Loss | Bleu   | Gen Len |
+|:-------------:|:-----:|:----:|:---------------:|:------:|:-------:|
+| 2.7537        | 0.71  | 500  | 2.2710          | 4.2969 | 35.4312 |
+| 2.6442        | 1.42  | 1000 | 2.2373          | 4.0784 | 35.1062 |
+| 2.6329        | 2.13  | 1500 | 2.2257          | 3.8894 | 36.225  |
+| 2.564         | 2.84  | 2000 | 2.2210          | 3.5513 | 36.076  |
+| 2.5352        | 3.56  | 2500 | 2.2151          | 3.7339 | 35.0885 |
+| 2.4991        | 4.27  | 3000 | 2.2078          | 3.4662 | 36.3333 |
+| 2.4782        | 4.98  | 3500 | 2.2100          | 3.3332 | 36.4062 |
+| 2.4363        | 5.69  | 4000 | 2.2085          | 3.3587 | 36.3135 |
+| 2.4411        | 6.4   | 4500 | 2.2034          | 3.8744 | 34.5073 |
+| 2.4002        | 7.11  | 5000 | 2.2036          | 3.6693 | 36.3448 |
+| 2.3841        | 7.82  | 5500 | 2.2030          | 3.7486 | 35.076  |
+| 2.3619        | 8.53  | 6000 | 2.1970          | 3.5687 | 35.8271 |
+| 2.3627        | 9.25  | 6500 | 2.2016          | 3.5394 | 35.3583 |
+| 2.3451        | 9.96  | 7000 | 2.1996          | 3.5863 | 34.9271 |
+| 2.3323        | 10.67 | 7500 | 2.2002          | 3.6349 | 35.5271 |
 ### Framework versions
 - Transformers 4.32.0
+- Pytorch 2.0.1+cu118
+- Datasets 2.14.4
 - Tokenizers 0.13.3

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e9008e22cabac5bb7495ff53393b9dc74923f6fa5740b555a1a6c7bad595af6e
 size 1935795713

 version https://git-lfs.github.com/spec/v1
+oid sha256:a62525647870141c47324b6a79e79b87f9d2f265f3fb63f128697333c64cf9b2
 size 1935795713