pakawadeep commited on
Commit
68fe352
·
1 Parent(s): d25dca4

Training in progress epoch 16

Browse files
README.md CHANGED
@@ -15,14 +15,14 @@ probably proofread and complete it, then remove this comment. -->
15
 
16
  This model is a fine-tuned version of [google/mt5-large](https://huggingface.co/google/mt5-large) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
- - Train Loss: 0.3488
19
- - Validation Loss: 0.6368
20
- - Train Rouge1: 8.2862
21
  - Train Rouge2: 0.7921
22
- - Train Rougel: 8.2390
23
- - Train Rougelsum: 8.2744
24
- - Train Gen Len: 11.8960
25
- - Epoch: 15
26
 
27
  ## Model description
28
 
@@ -64,6 +64,7 @@ The following hyperparameters were used during training:
64
  | 0.4161 | 0.6521 | 8.4689 | 1.3861 | 8.4335 | 8.4512 | 11.9307 | 13 |
65
  | 0.3812 | 0.6311 | 8.2862 | 0.7921 | 8.2390 | 8.2744 | 11.9109 | 14 |
66
  | 0.3488 | 0.6368 | 8.2862 | 0.7921 | 8.2390 | 8.2744 | 11.8960 | 15 |
 
67
 
68
 
69
  ### Framework versions
 
15
 
16
  This model is a fine-tuned version of [google/mt5-large](https://huggingface.co/google/mt5-large) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Train Loss: 0.3181
19
+ - Validation Loss: 0.6449
20
+ - Train Rouge1: 8.7812
21
  - Train Rouge2: 0.7921
22
+ - Train Rougel: 8.6987
23
+ - Train Rougelsum: 8.7930
24
+ - Train Gen Len: 11.9455
25
+ - Epoch: 16
26
 
27
  ## Model description
28
 
 
64
  | 0.4161 | 0.6521 | 8.4689 | 1.3861 | 8.4335 | 8.4512 | 11.9307 | 13 |
65
  | 0.3812 | 0.6311 | 8.2862 | 0.7921 | 8.2390 | 8.2744 | 11.9109 | 14 |
66
  | 0.3488 | 0.6368 | 8.2862 | 0.7921 | 8.2390 | 8.2744 | 11.8960 | 15 |
67
+ | 0.3181 | 0.6449 | 8.7812 | 0.7921 | 8.6987 | 8.7930 | 11.9455 | 16 |
68
 
69
 
70
  ### Framework versions
logs/train/events.out.tfevents.1719390059.41eaf3db5c10.191.0.v2 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:ee01559c41963eac646088f76fe12b6ed12222d38340818a1baf3f27eaca58d5
3
- size 13285160
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a0cfd524bfc02b9eed3a1abf2578bec6e28bb14923a77c01d207c0ac56a09467
3
+ size 13285582
logs/validation/events.out.tfevents.1719390803.41eaf3db5c10.191.1.v2 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:1847c06b407ac5fc1cdf37d2a960276f1f342b2b6a50ce8903f3d9359754e42c
3
- size 2579
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a2b45465b473c21ebe7e8d82fe76e88f5a60d85e7e3d08b8a9f114752054cbbd
3
+ size 2736
tf_model.h5 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:e6291b9b1fda428a0b126e03526e28f8eaea87d64b8828847b9407f1580b5738
3
  size 6968370776
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a019d4a269d6ad6dccd67abe7d025fde7771859abd977b7f4225563f06b6e1b2
3
  size 6968370776