File size: 3,401 Bytes
1f83327
 
 
 
 
 
 
 
 
 
 
 
 
 
1e6680f
1f83327
701b49f
 
 
 
 
 
1b2511e
701b49f
1f83327
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1b2511e
794255d
9f3c4f8
7761350
7bb29e5
1fdcb33
c997ca9
b8747ed
01f06e2
74c1d8b
8547e73
3eca7b9
06048fe
701b49f
1f83327
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
---
license: apache-2.0
tags:
- generated_from_keras_callback
model-index:
- name: Jammal7/t5-small-finetuned-Big-Patents
  results: []
---

<!-- This model card has been generated automatically according to the information Keras had access to. You should
probably proofread and complete it, then remove this comment. -->

# Jammal7/t5-small-finetuned-Big-Patents

This model is a fine-tuned version of [t5-small](https://huggingface.co/t5-small) on [big_patents](https://huggingface.co/datasets/big_patent) datasets.
It achieves the following results on the evaluation set:
- Train Loss: 2.1148
- Validation Loss: 2.0812
- Train Rouge1: 23.0058
- Train Rouge2: 12.5142
- Train Rougel: 19.9076
- Train Rougelsum: 20.8720
- Train Gen Len: 19.0
- Epoch: 14

## Model description

More information needed

## Intended uses & limitations

More information needed

## Training and evaluation data

More information needed

## Training procedure

### Training hyperparameters

The following hyperparameters were used during training:
- optimizer: {'name': 'AdamWeightDecay', 'learning_rate': 2e-05, 'decay': 0.0, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-07, 'amsgrad': False, 'weight_decay_rate': 0.01}
- training_precision: float32

### Training results

| Train Loss | Validation Loss | Train Rouge1 | Train Rouge2 | Train Rougel | Train Rougelsum | Train Gen Len | Epoch |
|:----------:|:---------------:|:------------:|:------------:|:------------:|:---------------:|:-------------:|:-----:|
| 2.6552     | 2.3220          | 21.3700      | 10.2105      | 17.5338      | 18.8802         | 18.9929       | 0     |
| 2.4395     | 2.2535          | 22.2253      | 11.2822      | 18.7238      | 19.9181         | 19.0          | 1     |
| 2.3755     | 2.2174          | 22.2327      | 11.5575      | 18.8348      | 19.9379         | 19.0          | 2     |
| 2.3329     | 2.1886          | 22.5328      | 11.8225      | 19.2582      | 20.3227         | 19.0          | 3     |
| 2.3004     | 2.1681          | 22.7872      | 12.1642      | 19.5194      | 20.5907         | 19.0          | 4     |
| 2.2696     | 2.1579          | 22.6969      | 12.0848      | 19.5012      | 20.5564         | 19.0          | 5     |
| 2.2487     | 2.1385          | 22.7300      | 12.2347      | 19.5685      | 20.5822         | 19.0          | 6     |
| 2.2249     | 2.1347          | 22.7250      | 12.2077      | 19.5103      | 20.5246         | 19.0          | 7     |
| 2.2060     | 2.1265          | 22.8921      | 12.4139      | 19.6903      | 20.7126         | 19.0          | 8     |
| 2.1892     | 2.1126          | 22.9440      | 12.4268      | 19.7717      | 20.8156         | 19.0          | 9     |
| 2.1740     | 2.1084          | 22.9201      | 12.4491      | 19.8107      | 20.8059         | 19.0          | 10    |
| 2.1580     | 2.0980          | 22.9947      | 12.3798      | 19.7494      | 20.7852         | 19.0          | 11    |
| 2.1414     | 2.0934          | 23.0885      | 12.5899      | 19.9531      | 20.9145         | 19.0          | 12    |
| 2.1302     | 2.0809          | 23.0854      | 12.5580      | 19.9468      | 20.8737         | 19.0          | 13    |
| 2.1148     | 2.0812          | 23.0058      | 12.5142      | 19.9076      | 20.8720         | 19.0          | 14    |


### Framework versions

- Transformers 4.30.1
- TensorFlow 2.12.0
- Datasets 2.12.0
- Tokenizers 0.13.3