dead-owwl commited on
Commit
7af89cc
1 Parent(s): 342c035

update model card README.md

Browse files
Files changed (1) hide show
  1. README.md +97 -0
README.md ADDED
@@ -0,0 +1,97 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ tags:
4
+ - generated_from_trainer
5
+ datasets:
6
+ - billsum
7
+ metrics:
8
+ - rouge
9
+ model-index:
10
+ - name: custom_billsum_model
11
+ results:
12
+ - task:
13
+ name: Sequence-to-sequence Language Modeling
14
+ type: text2text-generation
15
+ dataset:
16
+ name: billsum
17
+ type: billsum
18
+ config: default
19
+ split: ca_test
20
+ args: default
21
+ metrics:
22
+ - name: Rouge1
23
+ type: rouge
24
+ value: 0.1968
25
+ ---
26
+
27
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
28
+ should probably proofread and complete it, then remove this comment. -->
29
+
30
+ # custom_billsum_model
31
+
32
+ This model is a fine-tuned version of [t5-small](https://huggingface.co/t5-small) on the billsum dataset.
33
+ It achieves the following results on the evaluation set:
34
+ - Loss: 2.2150
35
+ - Rouge1: 0.1968
36
+ - Rouge2: 0.0981
37
+ - Rougel: 0.1672
38
+ - Rougelsum: 0.167
39
+ - Gen Len: 19.0
40
+
41
+ ## Model description
42
+
43
+ More information needed
44
+
45
+ ## Intended uses & limitations
46
+
47
+ More information needed
48
+
49
+ ## Training and evaluation data
50
+
51
+ More information needed
52
+
53
+ ## Training procedure
54
+
55
+ ### Training hyperparameters
56
+
57
+ The following hyperparameters were used during training:
58
+ - learning_rate: 2e-05
59
+ - train_batch_size: 16
60
+ - eval_batch_size: 16
61
+ - seed: 42
62
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
63
+ - lr_scheduler_type: linear
64
+ - num_epochs: 20
65
+
66
+ ### Training results
67
+
68
+ | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
69
+ |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|:-------:|
70
+ | No log | 1.0 | 62 | 2.4551 | 0.1626 | 0.0663 | 0.135 | 0.135 | 19.0 |
71
+ | No log | 2.0 | 124 | 2.3987 | 0.1882 | 0.0866 | 0.1577 | 0.1577 | 19.0 |
72
+ | No log | 3.0 | 186 | 2.3639 | 0.1964 | 0.0937 | 0.1652 | 0.165 | 19.0 |
73
+ | No log | 4.0 | 248 | 2.3370 | 0.1943 | 0.0931 | 0.164 | 0.1638 | 19.0 |
74
+ | No log | 5.0 | 310 | 2.3135 | 0.1942 | 0.0938 | 0.1646 | 0.1643 | 19.0 |
75
+ | No log | 6.0 | 372 | 2.2949 | 0.195 | 0.0938 | 0.1648 | 0.1648 | 19.0 |
76
+ | No log | 7.0 | 434 | 2.2809 | 0.1937 | 0.0944 | 0.1643 | 0.1642 | 19.0 |
77
+ | No log | 8.0 | 496 | 2.2676 | 0.1949 | 0.0957 | 0.1664 | 0.166 | 19.0 |
78
+ | 2.5047 | 9.0 | 558 | 2.2582 | 0.1954 | 0.097 | 0.1665 | 0.1662 | 19.0 |
79
+ | 2.5047 | 10.0 | 620 | 2.2510 | 0.1951 | 0.0966 | 0.1661 | 0.166 | 19.0 |
80
+ | 2.5047 | 11.0 | 682 | 2.2416 | 0.1962 | 0.0979 | 0.1673 | 0.1671 | 19.0 |
81
+ | 2.5047 | 12.0 | 744 | 2.2360 | 0.196 | 0.0975 | 0.1664 | 0.1663 | 19.0 |
82
+ | 2.5047 | 13.0 | 806 | 2.2302 | 0.1965 | 0.098 | 0.1667 | 0.1666 | 19.0 |
83
+ | 2.5047 | 14.0 | 868 | 2.2262 | 0.1973 | 0.0985 | 0.1673 | 0.1671 | 19.0 |
84
+ | 2.5047 | 15.0 | 930 | 2.2224 | 0.197 | 0.0976 | 0.1668 | 0.1667 | 19.0 |
85
+ | 2.5047 | 16.0 | 992 | 2.2200 | 0.1973 | 0.0984 | 0.1673 | 0.1671 | 19.0 |
86
+ | 2.3391 | 17.0 | 1054 | 2.2183 | 0.1967 | 0.0974 | 0.1669 | 0.1666 | 19.0 |
87
+ | 2.3391 | 18.0 | 1116 | 2.2164 | 0.1968 | 0.0974 | 0.1669 | 0.1666 | 19.0 |
88
+ | 2.3391 | 19.0 | 1178 | 2.2152 | 0.1969 | 0.0982 | 0.1673 | 0.1671 | 19.0 |
89
+ | 2.3391 | 20.0 | 1240 | 2.2150 | 0.1968 | 0.0981 | 0.1672 | 0.167 | 19.0 |
90
+
91
+
92
+ ### Framework versions
93
+
94
+ - Transformers 4.30.2
95
+ - Pytorch 2.0.0
96
+ - Datasets 2.1.0
97
+ - Tokenizers 0.13.3