rhaymison commited on
Commit
918e05d
1 Parent(s): 7ab2bb3

End of training

Browse files
Files changed (2) hide show
  1. README.md +24 -24
  2. model.safetensors +1 -1
README.md CHANGED
@@ -1,8 +1,8 @@
1
  ---
2
  license: apache-2.0
 
3
  tags:
4
  - generated_from_trainer
5
- base_model: google/flan-t5-small
6
  metrics:
7
  - rouge
8
  model-index:
@@ -17,11 +17,11 @@ should probably proofread and complete it, then remove this comment. -->
17
 
18
  This model is a fine-tuned version of [google/flan-t5-small](https://huggingface.co/google/flan-t5-small) on an unknown dataset.
19
  It achieves the following results on the evaluation set:
20
- - Loss: 1.7474
21
- - Rouge1: 15.6258
22
- - Rouge2: 5.8684
23
- - Rougel: 13.5135
24
- - Rougelsum: 14.5266
25
  - Gen Len: 19.0
26
 
27
  ## Model description
@@ -55,24 +55,24 @@ The following hyperparameters were used during training:
55
 
56
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
57
  |:-------------:|:-----:|:----:|:---------------:|:-------:|:------:|:-------:|:---------:|:-------:|
58
- | 2.3424 | 0.27 | 500 | 2.0519 | 13.8547 | 4.8819 | 12.0331 | 12.8514 | 19.0 |
59
- | 2.1616 | 0.53 | 1000 | 1.9535 | 14.7848 | 5.382 | 12.8365 | 13.6475 | 19.0 |
60
- | 2.0723 | 0.8 | 1500 | 1.9142 | 14.6906 | 5.434 | 12.8341 | 13.6491 | 19.0 |
61
- | 2.0202 | 1.07 | 2000 | 1.8883 | 14.8456 | 5.5148 | 12.7977 | 13.7626 | 19.0 |
62
- | 1.9921 | 1.33 | 2500 | 1.8473 | 14.8381 | 5.555 | 12.791 | 13.6959 | 19.0 |
63
- | 1.9539 | 1.6 | 3000 | 1.8293 | 15.2161 | 5.7276 | 13.1915 | 14.1315 | 19.0 |
64
- | 1.9455 | 1.87 | 3500 | 1.8166 | 15.2705 | 5.6751 | 13.2908 | 14.2064 | 19.0 |
65
- | 1.9266 | 2.13 | 4000 | 1.8018 | 15.303 | 5.7225 | 13.2318 | 14.1942 | 19.0 |
66
- | 1.8949 | 2.4 | 4500 | 1.7904 | 15.7181 | 6.0653 | 13.6993 | 14.5572 | 19.0 |
67
- | 1.906 | 2.67 | 5000 | 1.7814 | 15.7143 | 5.9897 | 13.6178 | 14.5986 | 19.0 |
68
- | 1.8737 | 2.93 | 5500 | 1.7706 | 15.4469 | 5.8011 | 13.3005 | 14.3128 | 19.0 |
69
- | 1.8779 | 3.2 | 6000 | 1.7668 | 15.6243 | 5.9534 | 13.5025 | 14.5397 | 19.0 |
70
- | 1.8638 | 3.47 | 6500 | 1.7629 | 15.3433 | 5.6495 | 13.251 | 14.3 | 19.0 |
71
- | 1.8644 | 3.73 | 7000 | 1.7559 | 15.4275 | 5.6924 | 13.2484 | 14.3135 | 19.0 |
72
- | 1.8389 | 4.0 | 7500 | 1.7522 | 15.5374 | 5.8713 | 13.4588 | 14.4702 | 19.0 |
73
- | 1.8467 | 4.27 | 8000 | 1.7507 | 15.47 | 5.7876 | 13.3985 | 14.4401 | 19.0 |
74
- | 1.8287 | 4.53 | 8500 | 1.7502 | 15.4761 | 5.7342 | 13.3502 | 14.4118 | 19.0 |
75
- | 1.8439 | 4.8 | 9000 | 1.7474 | 15.6258 | 5.8684 | 13.5135 | 14.5266 | 19.0 |
76
 
77
 
78
  ### Framework versions
 
1
  ---
2
  license: apache-2.0
3
+ base_model: google/flan-t5-small
4
  tags:
5
  - generated_from_trainer
 
6
  metrics:
7
  - rouge
8
  model-index:
 
17
 
18
  This model is a fine-tuned version of [google/flan-t5-small](https://huggingface.co/google/flan-t5-small) on an unknown dataset.
19
  It achieves the following results on the evaluation set:
20
+ - Loss: 1.6541
21
+ - Rouge1: 16.3352
22
+ - Rouge2: 6.2366
23
+ - Rougel: 14.1335
24
+ - Rougelsum: 15.2755
25
  - Gen Len: 19.0
26
 
27
  ## Model description
 
55
 
56
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
57
  |:-------------:|:-----:|:----:|:---------------:|:-------:|:------:|:-------:|:---------:|:-------:|
58
+ | 1.847 | 0.27 | 500 | 1.7443 | 15.4969 | 5.9408 | 13.5074 | 14.5518 | 19.0 |
59
+ | 1.8333 | 0.53 | 1000 | 1.7194 | 15.6496 | 5.8641 | 13.5584 | 14.669 | 19.0 |
60
+ | 1.8043 | 0.8 | 1500 | 1.7209 | 15.8523 | 6.0544 | 13.7563 | 14.8941 | 19.0 |
61
+ | 1.7903 | 1.07 | 2000 | 1.7156 | 15.8969 | 6.0071 | 13.7534 | 14.8513 | 19.0 |
62
+ | 1.7862 | 1.33 | 2500 | 1.7007 | 15.8441 | 5.958 | 13.66 | 14.7226 | 19.0 |
63
+ | 1.7687 | 1.6 | 3000 | 1.6949 | 15.9134 | 6.0486 | 13.9238 | 14.9171 | 19.0 |
64
+ | 1.7724 | 1.87 | 3500 | 1.6909 | 15.8827 | 5.8941 | 13.7195 | 14.8736 | 19.0 |
65
+ | 1.7653 | 2.13 | 4000 | 1.6811 | 16.0819 | 5.9791 | 13.8639 | 15.0031 | 19.0 |
66
+ | 1.7392 | 2.4 | 4500 | 1.6761 | 15.706 | 5.7384 | 13.5978 | 14.7374 | 19.0 |
67
+ | 1.7578 | 2.67 | 5000 | 1.6729 | 15.8926 | 5.9629 | 13.767 | 14.9088 | 19.0 |
68
+ | 1.7353 | 2.93 | 5500 | 1.6675 | 16.0266 | 5.9024 | 13.8471 | 14.9721 | 19.0 |
69
+ | 1.7425 | 3.2 | 6000 | 1.6626 | 16.0732 | 6.1141 | 13.9016 | 15.0673 | 19.0 |
70
+ | 1.73 | 3.47 | 6500 | 1.6631 | 16.1333 | 6.0951 | 13.9551 | 15.0686 | 19.0 |
71
+ | 1.7355 | 3.73 | 7000 | 1.6616 | 16.1704 | 6.1575 | 14.0481 | 15.079 | 19.0 |
72
+ | 1.7139 | 4.0 | 7500 | 1.6572 | 16.2592 | 6.25 | 14.0403 | 15.1851 | 19.0 |
73
+ | 1.7188 | 4.27 | 8000 | 1.6580 | 16.1572 | 6.0661 | 14.0029 | 15.0935 | 19.0 |
74
+ | 1.7045 | 4.53 | 8500 | 1.6560 | 16.1409 | 6.1478 | 13.9806 | 15.0795 | 19.0 |
75
+ | 1.7201 | 4.8 | 9000 | 1.6541 | 16.3352 | 6.2366 | 14.1335 | 15.2755 | 19.0 |
76
 
77
 
78
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:3894438bc3ce24b46387f75bb873f459030acea3c02d00c6a90190ad71a48c32
3
  size 307867048
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:09d7a2f0b9997cd291b3a026ef3a0f7db6e982aa93d1bc4e86c1c78a8d2bdc03
3
  size 307867048