File size: 4,834 Bytes
c290ce7
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
---
license: apache-2.0
base_model: t5-small
tags:
- generated_from_trainer
metrics:
- bleu
model-index:
- name: genz_model2
  results: []
---

<!-- This model card has been generated automatically according to the information the Trainer had access to. You
should probably proofread and complete it, then remove this comment. -->

# genz_model2

This model is a fine-tuned version of [t5-small](https://huggingface.co/t5-small) on the None dataset.
It achieves the following results on the evaluation set:
- Loss: 1.1282
- Bleu: 40.1672
- Gen Len: 15.25

## Model description

More information needed

## Intended uses & limitations

More information needed

## Training and evaluation data

More information needed

## Training procedure

### Training hyperparameters

The following hyperparameters were used during training:
- learning_rate: 2e-05
- train_batch_size: 16
- eval_batch_size: 16
- seed: 42
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
- lr_scheduler_type: linear
- num_epochs: 50

### Training results

| Training Loss | Epoch | Step | Validation Loss | Bleu    | Gen Len |
|:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|
| No log        | 1.0   | 107  | 1.9410          | 28.2848 | 15.4509 |
| No log        | 2.0   | 214  | 1.7415          | 32.3881 | 15.3645 |
| No log        | 3.0   | 321  | 1.6506          | 32.8796 | 15.5374 |
| No log        | 4.0   | 428  | 1.5856          | 33.1982 | 15.5748 |
| 1.9676        | 5.0   | 535  | 1.5352          | 34.3335 | 15.4556 |
| 1.9676        | 6.0   | 642  | 1.4929          | 34.962  | 15.5187 |
| 1.9676        | 7.0   | 749  | 1.4595          | 35.459  | 15.535  |
| 1.9676        | 8.0   | 856  | 1.4316          | 35.6253 | 15.5421 |
| 1.9676        | 9.0   | 963  | 1.4066          | 35.9011 | 15.4953 |
| 1.5695        | 10.0  | 1070 | 1.3838          | 36.5102 | 15.4907 |
| 1.5695        | 11.0  | 1177 | 1.3608          | 36.2464 | 15.5631 |
| 1.5695        | 12.0  | 1284 | 1.3410          | 36.3368 | 15.5748 |
| 1.5695        | 13.0  | 1391 | 1.3238          | 37.2607 | 15.493  |
| 1.5695        | 14.0  | 1498 | 1.3092          | 36.9306 | 15.5234 |
| 1.4322        | 15.0  | 1605 | 1.2943          | 37.2516 | 15.5701 |
| 1.4322        | 16.0  | 1712 | 1.2812          | 37.9106 | 15.4696 |
| 1.4322        | 17.0  | 1819 | 1.2694          | 38.0468 | 15.4907 |
| 1.4322        | 18.0  | 1926 | 1.2559          | 38.0982 | 15.4836 |
| 1.3384        | 19.0  | 2033 | 1.2455          | 38.5418 | 15.4556 |
| 1.3384        | 20.0  | 2140 | 1.2375          | 38.2567 | 15.4463 |
| 1.3384        | 21.0  | 2247 | 1.2285          | 38.3496 | 15.3972 |
| 1.3384        | 22.0  | 2354 | 1.2182          | 38.6696 | 15.4393 |
| 1.3384        | 23.0  | 2461 | 1.2092          | 38.6524 | 15.4182 |
| 1.2646        | 24.0  | 2568 | 1.2013          | 38.5694 | 15.4346 |
| 1.2646        | 25.0  | 2675 | 1.1947          | 38.8347 | 15.4065 |
| 1.2646        | 26.0  | 2782 | 1.1893          | 38.7466 | 15.3738 |
| 1.2646        | 27.0  | 2889 | 1.1840          | 38.8294 | 15.3855 |
| 1.2646        | 28.0  | 2996 | 1.1795          | 38.8043 | 15.3738 |
| 1.2144        | 29.0  | 3103 | 1.1722          | 38.9285 | 15.3995 |
| 1.2144        | 30.0  | 3210 | 1.1691          | 39.1174 | 15.3435 |
| 1.2144        | 31.0  | 3317 | 1.1646          | 39.2841 | 15.3341 |
| 1.2144        | 32.0  | 3424 | 1.1612          | 39.1613 | 15.2687 |
| 1.1741        | 33.0  | 3531 | 1.1581          | 39.2741 | 15.2921 |
| 1.1741        | 34.0  | 3638 | 1.1528          | 39.3863 | 15.3014 |
| 1.1741        | 35.0  | 3745 | 1.1501          | 39.5385 | 15.264  |
| 1.1741        | 36.0  | 3852 | 1.1465          | 39.7548 | 15.2897 |
| 1.1741        | 37.0  | 3959 | 1.1448          | 39.8433 | 15.25   |
| 1.1518        | 38.0  | 4066 | 1.1415          | 39.8777 | 15.2243 |
| 1.1518        | 39.0  | 4173 | 1.1398          | 40.0676 | 15.2453 |
| 1.1518        | 40.0  | 4280 | 1.1384          | 40.0178 | 15.2033 |
| 1.1518        | 41.0  | 4387 | 1.1348          | 39.8617 | 15.278  |
| 1.1518        | 42.0  | 4494 | 1.1336          | 39.9387 | 15.2664 |
| 1.1216        | 43.0  | 4601 | 1.1322          | 40.1468 | 15.257  |
| 1.1216        | 44.0  | 4708 | 1.1314          | 40.0534 | 15.257  |
| 1.1216        | 45.0  | 4815 | 1.1305          | 40.1604 | 15.257  |
| 1.1216        | 46.0  | 4922 | 1.1297          | 40.1344 | 15.2523 |
| 1.112         | 47.0  | 5029 | 1.1290          | 40.1921 | 15.2617 |
| 1.112         | 48.0  | 5136 | 1.1285          | 40.2545 | 15.25   |
| 1.112         | 49.0  | 5243 | 1.1283          | 40.1672 | 15.25   |
| 1.112         | 50.0  | 5350 | 1.1282          | 40.1672 | 15.25   |


### Framework versions

- Transformers 4.31.0
- Pytorch 2.0.1+cu118
- Datasets 2.14.3
- Tokenizers 0.13.3