VladimML commited on
Commit
1879b5b
1 Parent(s): 172c298

Training complete

Browse files
README.md CHANGED
@@ -18,11 +18,11 @@ should probably proofread and complete it, then remove this comment. -->
18
 
19
  This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
- - Loss: 2.4331
22
- - Rouge1: 6.6074
23
- - Rouge2: 0.8
24
- - Rougel: 6.5487
25
- - Rougelsum: 6.6233
26
 
27
  ## Model description
28
 
@@ -51,21 +51,21 @@ The following hyperparameters were used during training:
51
 
52
  ### Training results
53
 
54
- | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum |
55
- |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|
56
- | 5.7796 | 1.0 | 625 | 2.7282 | 4.0839 | 0.8785 | 4.0421 | 4.0758 |
57
- | 3.4978 | 2.0 | 1250 | 2.6171 | 6.6481 | 0.9067 | 6.6002 | 6.6437 |
58
- | 3.2419 | 3.0 | 1875 | 2.5208 | 6.6661 | 0.64 | 6.5897 | 6.6246 |
59
- | 3.1063 | 4.0 | 2500 | 2.4918 | 7.2246 | 1.0467 | 7.1758 | 7.1947 |
60
- | 3.0177 | 5.0 | 3125 | 2.4535 | 6.8523 | 1.0 | 6.8009 | 6.8923 |
61
- | 2.9537 | 6.0 | 3750 | 2.4452 | 6.459 | 0.8667 | 6.4208 | 6.481 |
62
- | 2.9156 | 7.0 | 4375 | 2.4373 | 6.5019 | 0.8 | 6.4421 | 6.5094 |
63
- | 2.8914 | 8.0 | 5000 | 2.4331 | 6.6074 | 0.8 | 6.5487 | 6.6233 |
64
 
65
 
66
  ### Framework versions
67
 
68
- - Transformers 4.37.2
69
  - Pytorch 2.1.0+cu121
70
- - Datasets 2.17.1
71
  - Tokenizers 0.15.2
 
18
 
19
  This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on an unknown dataset.
20
  It achieves the following results on the evaluation set:
21
+ - Loss: 2.3419
22
+ - Rouge1: 6.9313
23
+ - Rouge2: 1.9587
24
+ - Rougel: 6.8503
25
+ - Rougelsum: 6.9385
26
 
27
  ## Model description
28
 
 
51
 
52
  ### Training results
53
 
54
+ | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum |
55
+ |:-------------:|:-----:|:-----:|:---------------:|:------:|:------:|:------:|:---------:|
56
+ | 4.4281 | 1.0 | 1250 | 2.5899 | 7.0481 | 2.0747 | 6.9849 | 7.0179 |
57
+ | 3.2368 | 2.0 | 2500 | 2.4568 | 6.7532 | 1.7462 | 6.6934 | 6.7462 |
58
+ | 3.0526 | 3.0 | 3750 | 2.4315 | 6.6106 | 1.9088 | 6.5307 | 6.5784 |
59
+ | 2.9412 | 4.0 | 5000 | 2.3882 | 7.0644 | 1.9283 | 6.9687 | 7.0399 |
60
+ | 2.8711 | 5.0 | 6250 | 2.3700 | 7.2808 | 1.9358 | 7.2006 | 7.2603 |
61
+ | 2.8193 | 6.0 | 7500 | 2.3604 | 7.0911 | 1.9737 | 6.9918 | 7.0491 |
62
+ | 2.7866 | 7.0 | 8750 | 2.3479 | 6.9948 | 2.0044 | 6.8824 | 6.9737 |
63
+ | 2.7699 | 8.0 | 10000 | 2.3419 | 6.9313 | 1.9587 | 6.8503 | 6.9385 |
64
 
65
 
66
  ### Framework versions
67
 
68
+ - Transformers 4.38.2
69
  - Pytorch 2.1.0+cu121
70
+ - Datasets 2.18.0
71
  - Tokenizers 0.15.2
generation_config.json CHANGED
@@ -1,7 +1,6 @@
1
  {
2
- "_from_model_config": true,
3
  "decoder_start_token_id": 0,
4
  "eos_token_id": 1,
5
  "pad_token_id": 0,
6
- "transformers_version": "4.37.2"
7
  }
 
1
  {
 
2
  "decoder_start_token_id": 0,
3
  "eos_token_id": 1,
4
  "pad_token_id": 0,
5
+ "transformers_version": "4.38.2"
6
  }
runs/Mar13_11-29-40_a5db1099f1ab/events.out.tfevents.1710329390.a5db1099f1ab.979.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:08262543d3795db4b262684dbbe48b7088f950dad5f90e29aca17c02036eb347
3
- size 9846
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:697c050f7ef9aaf27ae6ae121bbb4cffaba4e4533529d9bf017ac10a5f847dce
3
+ size 10674
runs/Mar13_11-29-40_a5db1099f1ab/events.out.tfevents.1710332280.a5db1099f1ab.979.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9eb84129b8f75f84e222eb1b730cad08e6396de0584a0058fccc7cd73223f89d
3
+ size 562