ad019el commited on
Commit
1408813
1 Parent(s): c52f047

End of training

Browse files
Files changed (2) hide show
  1. README.md +30 -4
  2. pytorch_model.bin +1 -1
README.md CHANGED
@@ -1,8 +1,9 @@
1
  ---
2
- license: mit
3
  base_model: ad019el/m2m100_418M-finetuned-tq-to-ar
4
  tags:
5
  - generated_from_trainer
 
 
6
  model-index:
7
  - name: m2m100_418M-finetuned-tq-to-ar-1
8
  results: []
@@ -13,7 +14,11 @@ should probably proofread and complete it, then remove this comment. -->
13
 
14
  # m2m100_418M-finetuned-tq-to-ar-1
15
 
16
- This model is a fine-tuned version of [ad019el/m2m100_418M-finetuned-tq-to-ar](https://huggingface.co/ad019el/m2m100_418M-finetuned-tq-to-ar) on an unknown dataset.
 
 
 
 
17
 
18
  ## Model description
19
 
@@ -40,9 +45,30 @@ The following hyperparameters were used during training:
40
  - lr_scheduler_type: linear
41
  - num_epochs: 15
42
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
43
  ### Framework versions
44
 
45
  - Transformers 4.32.0
46
- - Pytorch 2.0.0
47
- - Datasets 2.1.0
48
  - Tokenizers 0.13.3
 
1
  ---
 
2
  base_model: ad019el/m2m100_418M-finetuned-tq-to-ar
3
  tags:
4
  - generated_from_trainer
5
+ metrics:
6
+ - bleu
7
  model-index:
8
  - name: m2m100_418M-finetuned-tq-to-ar-1
9
  results: []
 
14
 
15
  # m2m100_418M-finetuned-tq-to-ar-1
16
 
17
+ This model is a fine-tuned version of [ad019el/m2m100_418M-finetuned-tq-to-ar](https://huggingface.co/ad019el/m2m100_418M-finetuned-tq-to-ar) on the None dataset.
18
+ It achieves the following results on the evaluation set:
19
+ - Loss: 2.2002
20
+ - Bleu: 3.6349
21
+ - Gen Len: 35.5271
22
 
23
  ## Model description
24
 
 
45
  - lr_scheduler_type: linear
46
  - num_epochs: 15
47
 
48
+ ### Training results
49
+
50
+ | Training Loss | Epoch | Step | Validation Loss | Bleu | Gen Len |
51
+ |:-------------:|:-----:|:----:|:---------------:|:------:|:-------:|
52
+ | 2.7537 | 0.71 | 500 | 2.2710 | 4.2969 | 35.4312 |
53
+ | 2.6442 | 1.42 | 1000 | 2.2373 | 4.0784 | 35.1062 |
54
+ | 2.6329 | 2.13 | 1500 | 2.2257 | 3.8894 | 36.225 |
55
+ | 2.564 | 2.84 | 2000 | 2.2210 | 3.5513 | 36.076 |
56
+ | 2.5352 | 3.56 | 2500 | 2.2151 | 3.7339 | 35.0885 |
57
+ | 2.4991 | 4.27 | 3000 | 2.2078 | 3.4662 | 36.3333 |
58
+ | 2.4782 | 4.98 | 3500 | 2.2100 | 3.3332 | 36.4062 |
59
+ | 2.4363 | 5.69 | 4000 | 2.2085 | 3.3587 | 36.3135 |
60
+ | 2.4411 | 6.4 | 4500 | 2.2034 | 3.8744 | 34.5073 |
61
+ | 2.4002 | 7.11 | 5000 | 2.2036 | 3.6693 | 36.3448 |
62
+ | 2.3841 | 7.82 | 5500 | 2.2030 | 3.7486 | 35.076 |
63
+ | 2.3619 | 8.53 | 6000 | 2.1970 | 3.5687 | 35.8271 |
64
+ | 2.3627 | 9.25 | 6500 | 2.2016 | 3.5394 | 35.3583 |
65
+ | 2.3451 | 9.96 | 7000 | 2.1996 | 3.5863 | 34.9271 |
66
+ | 2.3323 | 10.67 | 7500 | 2.2002 | 3.6349 | 35.5271 |
67
+
68
+
69
  ### Framework versions
70
 
71
  - Transformers 4.32.0
72
+ - Pytorch 2.0.1+cu118
73
+ - Datasets 2.14.4
74
  - Tokenizers 0.13.3
pytorch_model.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:e9008e22cabac5bb7495ff53393b9dc74923f6fa5740b555a1a6c7bad595af6e
3
  size 1935795713
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a62525647870141c47324b6a79e79b87f9d2f265f3fb63f128697333c64cf9b2
3
  size 1935795713