yonathanstwn commited on
Commit
104f1e8
1 Parent(s): e8d150e

update model card README.md

Browse files
Files changed (1) hide show
  1. README.md +98 -0
README.md ADDED
@@ -0,0 +1,98 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - generated_from_trainer
4
+ datasets:
5
+ - open_subtitles
6
+ metrics:
7
+ - bleu
8
+ model-index:
9
+ - name: opus-mt-en-id-open-subtitles
10
+ results:
11
+ - task:
12
+ name: Sequence-to-sequence Language Modeling
13
+ type: text2text-generation
14
+ dataset:
15
+ name: open_subtitles
16
+ type: open_subtitles
17
+ config: en-id
18
+ split: train
19
+ args: en-id
20
+ metrics:
21
+ - name: Bleu
22
+ type: bleu
23
+ value: 30.2272
24
+ ---
25
+
26
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
27
+ should probably proofread and complete it, then remove this comment. -->
28
+
29
+ # opus-mt-en-id-open-subtitles
30
+
31
+ This model was trained from scratch on the open_subtitles dataset.
32
+ It achieves the following results on the evaluation set:
33
+ - Loss: 2.3148
34
+ - Bleu: 30.2272
35
+
36
+ ## Model description
37
+
38
+ More information needed
39
+
40
+ ## Intended uses & limitations
41
+
42
+ More information needed
43
+
44
+ ## Training and evaluation data
45
+
46
+ More information needed
47
+
48
+ ## Training procedure
49
+
50
+ ### Training hyperparameters
51
+
52
+ The following hyperparameters were used during training:
53
+ - learning_rate: 0.0001
54
+ - train_batch_size: 32
55
+ - eval_batch_size: 32
56
+ - seed: 42
57
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
58
+ - lr_scheduler_type: linear
59
+ - lr_scheduler_warmup_steps: 4000
60
+ - num_epochs: 25
61
+
62
+ ### Training results
63
+
64
+ | Training Loss | Epoch | Step | Validation Loss | Bleu |
65
+ |:-------------:|:-----:|:------:|:---------------:|:-------:|
66
+ | 1.5356 | 1.0 | 28125 | 1.5619 | 31.8599 |
67
+ | 1.4703 | 2.0 | 56250 | 1.6047 | 31.8339 |
68
+ | 1.3857 | 3.0 | 84375 | 1.6281 | 32.0796 |
69
+ | 1.313 | 4.0 | 112500 | 1.6619 | 31.7391 |
70
+ | 1.2468 | 5.0 | 140625 | 1.6706 | 31.9009 |
71
+ | 1.1831 | 6.0 | 168750 | 1.6924 | 31.4491 |
72
+ | 1.1232 | 7.0 | 196875 | 1.7252 | 31.7229 |
73
+ | 1.0649 | 8.0 | 225000 | 1.7483 | 31.7093 |
74
+ | 1.0078 | 9.0 | 253125 | 1.7697 | 31.4902 |
75
+ | 0.9516 | 10.0 | 281250 | 1.8026 | 31.4342 |
76
+ | 0.8969 | 11.0 | 309375 | 1.8364 | 31.2466 |
77
+ | 0.8436 | 12.0 | 337500 | 1.8747 | 31.1737 |
78
+ | 0.7916 | 13.0 | 365625 | 1.9035 | 31.0118 |
79
+ | 0.7406 | 14.0 | 393750 | 1.9414 | 30.9409 |
80
+ | 0.6912 | 15.0 | 421875 | 1.9776 | 30.9562 |
81
+ | 0.6439 | 16.0 | 450000 | 2.0221 | 30.582 |
82
+ | 0.5983 | 17.0 | 478125 | 2.0588 | 30.4478 |
83
+ | 0.5544 | 18.0 | 506250 | 2.1023 | 30.4601 |
84
+ | 0.5126 | 19.0 | 534375 | 2.1367 | 30.4802 |
85
+ | 0.474 | 20.0 | 562500 | 2.1790 | 30.4211 |
86
+ | 0.438 | 21.0 | 590625 | 2.2131 | 30.3327 |
87
+ | 0.4039 | 22.0 | 618750 | 2.2484 | 30.196 |
88
+ | 0.3737 | 23.0 | 646875 | 2.2779 | 30.1145 |
89
+ | 0.3475 | 24.0 | 675000 | 2.3022 | 30.2635 |
90
+ | 0.326 | 25.0 | 703125 | 2.3148 | 30.2272 |
91
+
92
+
93
+ ### Framework versions
94
+
95
+ - Transformers 4.26.1
96
+ - Pytorch 2.0.0
97
+ - Datasets 2.10.1
98
+ - Tokenizers 0.11.0