File size: 3,719 Bytes
80bb9e9
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
---
license: apache-2.0
tags:
- generated_from_trainer
metrics:
- rouge
model-index:
- name: t5-small-mse-summarization
  results: []
---

<!-- This model card has been generated automatically according to the information the Trainer had access to. You
should probably proofread and complete it, then remove this comment. -->

# t5-small-mse-summarization

This model is a fine-tuned version of [t5-small](https://huggingface.co/t5-small) on an unknown dataset.
It achieves the following results on the evaluation set:
- Loss: 1.2293
- Rouge1: 40.0683
- Rouge2: 20.2468
- Rougel: 34.0606
- Rougelsum: 38.0836
- Bleurt: -0.8806
- Gen Len: 18.649

## Model description

More information needed

## Intended uses & limitations

More information needed

## Training and evaluation data

More information needed

## Training procedure

### Training hyperparameters

The following hyperparameters were used during training:
- learning_rate: 1e-05
- train_batch_size: 64
- eval_batch_size: 256
- seed: 42
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
- lr_scheduler_type: linear
- num_epochs: 20

### Training results

| Training Loss | Epoch | Step | Validation Loss | Rouge1  | Rouge2  | Rougel  | Rougelsum | Bleurt  | Gen Len |
|:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:-------:|:-------:|
| 1.768         | 1.0   | 267  | 1.4680          | 36.028  | 16.6997 | 30.4417 | 33.8528   | -0.9554 | 18.557  |
| 1.5588        | 2.0   | 534  | 1.3877          | 37.4937 | 18.2652 | 32.1414 | 35.621    | -0.9248 | 18.646  |
| 1.503         | 3.0   | 801  | 1.3469          | 38.1407 | 18.7353 | 32.5747 | 36.3185   | -0.9069 | 18.649  |
| 1.4721        | 4.0   | 1068 | 1.3226          | 38.1918 | 18.5221 | 32.4574 | 36.3975   | -0.9071 | 18.661  |
| 1.4402        | 5.0   | 1335 | 1.3061          | 38.672  | 18.8355 | 32.734  | 36.7534   | -0.9074 | 18.696  |
| 1.4141        | 6.0   | 1602 | 1.2909          | 38.9248 | 19.0159 | 33.0053 | 36.98     | -0.9066 | 18.677  |
| 1.4034        | 7.0   | 1869 | 1.2779          | 39.3301 | 19.2995 | 33.2336 | 37.3958   | -0.9047 | 18.68   |
| 1.3864        | 8.0   | 2136 | 1.2686          | 39.5046 | 19.5836 | 33.4436 | 37.46     | -0.8928 | 18.681  |
| 1.3801        | 9.0   | 2403 | 1.2599          | 39.6226 | 19.6625 | 33.6596 | 37.6379   | -0.8954 | 18.686  |
| 1.3714        | 10.0  | 2670 | 1.2555          | 39.4381 | 19.5523 | 33.4644 | 37.4258   | -0.8983 | 18.721  |
| 1.3586        | 11.0  | 2937 | 1.2493          | 39.6582 | 19.7031 | 33.5629 | 37.5895   | -0.8951 | 18.707  |
| 1.3482        | 12.0  | 3204 | 1.2436          | 39.6473 | 19.6636 | 33.631  | 37.643    | -0.8945 | 18.7    |
| 1.3448        | 13.0  | 3471 | 1.2407          | 39.6741 | 19.686  | 33.6859 | 37.6884   | -0.8922 | 18.661  |
| 1.3458        | 14.0  | 3738 | 1.2382          | 39.7934 | 19.879  | 33.8368 | 37.8078   | -0.8863 | 18.658  |
| 1.3315        | 15.0  | 4005 | 1.2343          | 39.812  | 19.935  | 33.8546 | 37.8262   | -0.8859 | 18.666  |
| 1.3374        | 16.0  | 4272 | 1.2335          | 39.7989 | 19.9576 | 33.8681 | 37.803    | -0.885  | 18.657  |
| 1.3301        | 17.0  | 4539 | 1.2315          | 39.9386 | 20.0602 | 33.941  | 37.9452   | -0.8853 | 18.656  |
| 1.3295        | 18.0  | 4806 | 1.2303          | 40.0492 | 20.1841 | 34.0707 | 38.0749   | -0.8823 | 18.651  |
| 1.3284        | 19.0  | 5073 | 1.2294          | 40.0335 | 20.2042 | 34.061  | 38.0575   | -0.881  | 18.649  |
| 1.3249        | 20.0  | 5340 | 1.2293          | 40.0683 | 20.2468 | 34.0606 | 38.0836   | -0.8806 | 18.649  |


### Framework versions

- Transformers 4.21.2
- Pytorch 1.12.1+cu113
- Datasets 2.4.0
- Tokenizers 0.12.1