muvazana commited on
Commit
f79f446
·
1 Parent(s): fe96ff9

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +9 -9
README.md CHANGED
@@ -28,7 +28,7 @@ should probably proofread and complete it, then remove this comment. -->
28
 
29
  # flan-t5-base-opus-en-id-id-en
30
 
31
- This model was trained from scratch on an unknown dataset.
32
  It achieves the following results on the evaluation set:
33
  - Loss: 1.3685
34
  - Score: 35.0259
@@ -39,7 +39,7 @@ It achieves the following results on the evaluation set:
39
  - Sys Len: 7288
40
  - Ref Len: 7354
41
  - Gen Len: 10.556
42
- <!--- Learning Rate: 0.0004-->
43
 
44
  ## Model description
45
 
@@ -71,13 +71,13 @@ The following hyperparameters were used during training:
71
 
72
  ### Training results
73
 
74
- | Training Loss | Epoch | Step | Validation Loss | Score | Counts | Totals | Precisions | Bp | Sys Len | Ref Len | Gen Len | Rate |
75
- |:-------------:|:-----:|:-----:|:---------------:|:-------:|:-----------------------:|:------------------------:|:--------------------------------------------------------------------------------:|:------:|:-------:|:-------:|:-------:|:------:|
76
- | 1.6959 | 0.55 | 4000 | 1.5776 | 30.6542 | [4414, 2368, 1345, 733] | [7417, 6417, 5426, 4519] | [59.511932047997846, 36.9019791179679, 24.78805750092149, 16.220402743969906] | 1.0 | 7417 | 7354 | 10.77 | 0.0045 |
77
- | 1.4378 | 1.11 | 8000 | 1.4527 | 32.3772 | [4526, 2538, 1483, 834] | [7567, 6567, 5576, 4666] | [59.81234306858729, 38.647784376427595, 26.596126255380202, 17.873981997428203] | 1.0 | 7567 | 7354 | 10.885 | 0.0035 |
78
- | 1.3904 | 1.66 | 12000 | 1.3961 | 33.8978 | [4558, 2559, 1494, 836] | [7286, 6286, 5295, 4383] | [62.55833104584134, 40.70951320394528, 28.21529745042493, 19.073693817020306] | 0.9907 | 7286 | 7354 | 10.569 | 0.0025 |
79
- | 1.3035 | 2.21 | 16000 | 1.3758 | 34.9471 | [4609, 2628, 1546, 880] | [7297, 6297, 5306, 4392] | [63.16294367548308, 41.73415912339209, 29.136826234451565, 20.036429872495447] | 0.9922 | 7297 | 7354 | 10.591 | 0.0015 |
80
- | 1.2994 | 2.77 | 20000 | 1.3685 | 35.0259 | [4617, 2627, 1550, 883] | [7288, 6288, 5297, 4382] | [63.350713501646545, 41.777989821882954, 29.261846328110252, 20.150616157005935] | 0.991 | 7288 | 7354 | 10.556 | 0.0004 |
81
 
82
 
83
  ### Framework versions
 
28
 
29
  # flan-t5-base-opus-en-id-id-en
30
 
31
+ <!---This model was trained from scratch on an unknown dataset.
32
  It achieves the following results on the evaluation set:
33
  - Loss: 1.3685
34
  - Score: 35.0259
 
39
  - Sys Len: 7288
40
  - Ref Len: 7354
41
  - Gen Len: 10.556
42
+ Learning Rate: 0.0004-->
43
 
44
  ## Model description
45
 
 
71
 
72
  ### Training results
73
 
74
+ | Training Loss | Epoch | Step | Validation Loss | Score | Counts | Totals | Precisions | Bp | Sys Len | Ref Len | Gen Len |
75
+ |:-------------:|:-----:|:-----:|:---------------:|:-------:|:-----------------------:|:------------------------:|:--------------------------------------------------------------------------------:|:------:|:-------:|:-------:|:-------:|
76
+ | 1.6959 | 0.55 | 4000 | 1.5776 | 30.6542 | [4414, 2368, 1345, 733] | [7417, 6417, 5426, 4519] | [59.511932047997846, 36.9019791179679, 24.78805750092149, 16.220402743969906] | 1.0 | 7417 | 7354 | 10.77 |
77
+ | 1.4378 | 1.11 | 8000 | 1.4527 | 32.3772 | [4526, 2538, 1483, 834] | [7567, 6567, 5576, 4666] | [59.81234306858729, 38.647784376427595, 26.596126255380202, 17.873981997428203] | 1.0 | 7567 | 7354 | 10.885 |
78
+ | 1.3904 | 1.66 | 12000 | 1.3961 | 33.8978 | [4558, 2559, 1494, 836] | [7286, 6286, 5295, 4383] | [62.55833104584134, 40.70951320394528, 28.21529745042493, 19.073693817020306] | 0.9907 | 7286 | 7354 | 10.569 |
79
+ | 1.3035 | 2.21 | 16000 | 1.3758 | 34.9471 | [4609, 2628, 1546, 880] | [7297, 6297, 5306, 4392] | [63.16294367548308, 41.73415912339209, 29.136826234451565, 20.036429872495447] | 0.9922 | 7297 | 7354 | 10.591 |
80
+ | 1.2994 | 2.77 | 20000 | 1.3685 | 35.0259 | [4617, 2627, 1550, 883] | [7288, 6288, 5297, 4382] | [63.350713501646545, 41.777989821882954, 29.261846328110252, 20.150616157005935] | 0.991 | 7288 | 7354 | 10.556 |
81
 
82
 
83
  ### Framework versions