pakawadeep commited on
Commit
735db61
·
1 Parent(s): 564e823

Training in progress epoch 5

Browse files
README.md CHANGED
@@ -15,14 +15,14 @@ probably proofread and complete it, then remove this comment. -->
15
 
16
  This model is a fine-tuned version of [google/mt5-large](https://huggingface.co/google/mt5-large) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
- - Train Loss: 1.0395
19
- - Validation Loss: 0.9417
20
- - Train Rouge1: 7.4257
21
  - Train Rouge2: 1.8812
22
- - Train Rougel: 7.4257
23
- - Train Rougelsum: 7.4022
24
- - Train Gen Len: 11.9703
25
- - Epoch: 4
26
 
27
  ## Model description
28
 
@@ -53,6 +53,7 @@ The following hyperparameters were used during training:
53
  | 1.5929 | 1.2365 | 6.2235 | 1.0891 | 6.2235 | 6.2235 | 11.6089 | 2 |
54
  | 1.3718 | 1.0833 | 7.7086 | 1.5842 | 7.4965 | 7.4965 | 11.9406 | 3 |
55
  | 1.0395 | 0.9417 | 7.4257 | 1.8812 | 7.4257 | 7.4022 | 11.9703 | 4 |
 
56
 
57
 
58
  ### Framework versions
 
15
 
16
  This model is a fine-tuned version of [google/mt5-large](https://huggingface.co/google/mt5-large) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Train Loss: 0.8993
19
+ - Validation Loss: 0.8573
20
+ - Train Rouge1: 8.5337
21
  - Train Rouge2: 1.8812
22
+ - Train Rougel: 8.4394
23
+ - Train Rougelsum: 8.4158
24
+ - Train Gen Len: 11.9059
25
+ - Epoch: 5
26
 
27
  ## Model description
28
 
 
53
  | 1.5929 | 1.2365 | 6.2235 | 1.0891 | 6.2235 | 6.2235 | 11.6089 | 2 |
54
  | 1.3718 | 1.0833 | 7.7086 | 1.5842 | 7.4965 | 7.4965 | 11.9406 | 3 |
55
  | 1.0395 | 0.9417 | 7.4257 | 1.8812 | 7.4257 | 7.4022 | 11.9703 | 4 |
56
+ | 0.8993 | 0.8573 | 8.5337 | 1.8812 | 8.4394 | 8.4158 | 11.9059 | 5 |
57
 
58
 
59
  ### Framework versions
logs/train/events.out.tfevents.1720755221.0655e1851386.4174.0.v2 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:eea2784b4f78c0591b5354f2584cc032df16fa82c5919d68ac9ee3c76f0fa0af
3
- size 13280518
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ddefc088bf1d68bfb0c625e5cb833037f87e0a954314326a4176f5defde52641
3
+ size 13280940
logs/validation/events.out.tfevents.1720756090.0655e1851386.4174.1.v2 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:2c5c829eb1f701d639ad4b7a526adeb4a3787447a69ccd3075f56924343c38fc
3
- size 856
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:36d31d06ccb4dcbef2a2f703a4a4f18cd9aeed94c40f0c2942dbade1a278067e
3
+ size 1012
tf_model.h5 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:1c080c836a699806a762c2b72f98d5ef04b011cccab1c13b6d4ee8240d39b464
3
  size 6968370776
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:30123b0b74375038faec6b97afb0b9be3792589947917dab600e228a1749251d
3
  size 6968370776