santyzenith commited on
Commit
7d13e88
1 Parent(s): 8b8f86c

End of training

Browse files
Files changed (1) hide show
  1. README.md +18 -12
README.md CHANGED
@@ -17,12 +17,12 @@ should probably proofread and complete it, then remove this comment. -->
17
 
18
  This model is a fine-tuned version of [flax-community/spanish-t5-small](https://huggingface.co/flax-community/spanish-t5-small) on the None dataset.
19
  It achieves the following results on the evaluation set:
20
- - Loss: 1.3716
21
- - Rouge1: 44.6732
22
- - Rouge2: 29.5427
23
- - Rougel: 44.3407
24
- - Rougelsum: 44.3626
25
- - Gen Len: 8.2906
26
 
27
  ## Model description
28
 
@@ -41,22 +41,28 @@ More information needed
41
  ### Training hyperparameters
42
 
43
  The following hyperparameters were used during training:
44
- - learning_rate: 5e-05
45
  - train_batch_size: 8
46
  - eval_batch_size: 8
47
  - seed: 42
48
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
49
  - lr_scheduler_type: linear
50
- - num_epochs: 4
51
 
52
  ### Training results
53
 
54
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
55
  |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:-------:|
56
- | 2.1475 | 1.0 | 527 | 1.7342 | 37.3065 | 21.7821 | 36.9349 | 36.853 | 8.2799 |
57
- | 1.6495 | 2.0 | 1054 | 1.5130 | 42.3167 | 26.6311 | 41.9494 | 41.8617 | 8.1496 |
58
- | 1.445 | 3.0 | 1581 | 1.4049 | 43.5614 | 28.5147 | 43.2416 | 43.2528 | 8.2585 |
59
- | 1.2756 | 4.0 | 2108 | 1.3716 | 44.6732 | 29.5427 | 44.3407 | 44.3626 | 8.2906 |
 
 
 
 
 
 
60
 
61
 
62
  ### Framework versions
 
17
 
18
  This model is a fine-tuned version of [flax-community/spanish-t5-small](https://huggingface.co/flax-community/spanish-t5-small) on the None dataset.
19
  It achieves the following results on the evaluation set:
20
+ - Loss: 1.4052
21
+ - Rouge1: 42.0934
22
+ - Rouge2: 28.3804
23
+ - Rougel: 41.2489
24
+ - Rougelsum: 41.3148
25
+ - Gen Len: 8.4979
26
 
27
  ## Model description
28
 
 
41
  ### Training hyperparameters
42
 
43
  The following hyperparameters were used during training:
44
+ - learning_rate: 2e-05
45
  - train_batch_size: 8
46
  - eval_batch_size: 8
47
  - seed: 42
48
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
49
  - lr_scheduler_type: linear
50
+ - num_epochs: 10
51
 
52
  ### Training results
53
 
54
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
55
  |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:-------:|
56
+ | 2.3625 | 1.0 | 527 | 2.0249 | 34.7282 | 19.5441 | 33.9362 | 33.9053 | 7.6303 |
57
+ | 1.9453 | 2.0 | 1054 | 1.7992 | 37.5587 | 22.5669 | 36.6904 | 36.7767 | 8.2671 |
58
+ | 1.7429 | 3.0 | 1581 | 1.6655 | 39.7178 | 24.1301 | 38.8293 | 38.8615 | 8.3462 |
59
+ | 1.6306 | 4.0 | 2108 | 1.5791 | 40.8591 | 26.0948 | 40.088 | 40.0812 | 7.9573 |
60
+ | 1.461 | 5.0 | 2635 | 1.5204 | 40.6998 | 25.5825 | 39.8972 | 39.9642 | 8.3568 |
61
+ | 1.4178 | 6.0 | 3162 | 1.4702 | 41.0675 | 26.4585 | 40.1879 | 40.2333 | 8.4679 |
62
+ | 1.3394 | 7.0 | 3689 | 1.4452 | 41.9649 | 27.1925 | 41.0945 | 41.1233 | 8.2329 |
63
+ | 1.2844 | 8.0 | 4216 | 1.4210 | 41.9633 | 27.7102 | 40.9804 | 41.0706 | 8.5406 |
64
+ | 1.2151 | 9.0 | 4743 | 1.4072 | 41.9336 | 28.0917 | 41.0597 | 41.1328 | 8.4722 |
65
+ | 1.215 | 10.0 | 5270 | 1.4052 | 42.0934 | 28.3804 | 41.2489 | 41.3148 | 8.4979 |
66
 
67
 
68
  ### Framework versions