veronica-girolimetti commited on
Commit
c1c3406
1 Parent(s): 56836c9

End of training

Browse files
README.md ADDED
@@ -0,0 +1,67 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ base_model: google/flan-t5-small
4
+ tags:
5
+ - generated_from_trainer
6
+ metrics:
7
+ - rouge
8
+ model-index:
9
+ - name: test-dialogue-summarization
10
+ results: []
11
+ ---
12
+
13
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
14
+ should probably proofread and complete it, then remove this comment. -->
15
+
16
+ # test-dialogue-summarization
17
+
18
+ This model is a fine-tuned version of [google/flan-t5-small](https://huggingface.co/google/flan-t5-small) on an unknown dataset.
19
+ It achieves the following results on the evaluation set:
20
+ - Loss: 2.3848
21
+ - Rouge: {'rouge1': 45.6704, 'rouge2': 21.245, 'rougeL': 20.2411, 'rougeLsum': 20.2411}
22
+ - Bert Score: 0.8718
23
+ - Bleurt 20: -0.8946
24
+ - Gen Len: 14.99
25
+
26
+ ## Model description
27
+
28
+ More information needed
29
+
30
+ ## Intended uses & limitations
31
+
32
+ More information needed
33
+
34
+ ## Training and evaluation data
35
+
36
+ More information needed
37
+
38
+ ## Training procedure
39
+
40
+ ### Training hyperparameters
41
+
42
+ The following hyperparameters were used during training:
43
+ - learning_rate: 2e-05
44
+ - train_batch_size: 7
45
+ - eval_batch_size: 7
46
+ - seed: 42
47
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
48
+ - lr_scheduler_type: linear
49
+ - num_epochs: 5
50
+
51
+ ### Training results
52
+
53
+ | Training Loss | Epoch | Step | Validation Loss | Rouge | Bert Score | Bleurt 20 | Gen Len |
54
+ |:-------------:|:-----:|:----:|:---------------:|:-------------------------------------------------------------------------------:|:----------:|:---------:|:-------:|
55
+ | 2.2971 | 1.0 | 186 | 2.4618 | {'rouge1': 46.8614, 'rouge2': 21.2026, 'rougeL': 20.3766, 'rougeLsum': 20.3766} | 0.8719 | -0.8804 | 15.38 |
56
+ | 2.5384 | 2.0 | 372 | 2.4062 | {'rouge1': 46.4924, 'rouge2': 21.6223, 'rougeL': 20.304, 'rougeLsum': 20.304} | 0.8724 | -0.8819 | 15.145 |
57
+ | 2.5224 | 3.0 | 558 | 2.3936 | {'rouge1': 46.0775, 'rouge2': 21.2642, 'rougeL': 20.3148, 'rougeLsum': 20.3148} | 0.871 | -0.8905 | 15.345 |
58
+ | 2.4661 | 4.0 | 744 | 2.3867 | {'rouge1': 45.6718, 'rouge2': 21.122, 'rougeL': 20.2877, 'rougeLsum': 20.2877} | 0.8719 | -0.8887 | 14.975 |
59
+ | 2.4426 | 5.0 | 930 | 2.3848 | {'rouge1': 45.6704, 'rouge2': 21.245, 'rougeL': 20.2411, 'rougeLsum': 20.2411} | 0.8718 | -0.8946 | 14.99 |
60
+
61
+
62
+ ### Framework versions
63
+
64
+ - Transformers 4.35.2
65
+ - Pytorch 2.1.0+cu121
66
+ - Datasets 2.16.1
67
+ - Tokenizers 0.15.0
generation_config.json ADDED
@@ -0,0 +1,6 @@
 
 
 
 
 
 
 
1
+ {
2
+ "decoder_start_token_id": 0,
3
+ "eos_token_id": 1,
4
+ "pad_token_id": 0,
5
+ "transformers_version": "4.35.2"
6
+ }
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:fd99ea99e8dbcc686d75143c7754c654f3afc71ba567ca4064110bf35cfd2ea3
3
  size 307867048
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3b61732238e1636cfcc09b7518f9e3d85b99f63fb9386355ee21a29daa25b188
3
  size 307867048
runs/Jan05_20-55-51_3ab4673dd019/events.out.tfevents.1704488152.3ab4673dd019.445.6 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:feaf1b819b9b5b435213035048d2e0f9d20c23240a9812fa92d75f4185cdf1a2
3
- size 7071
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9f23cdd05badcb5a52bd82dcb272dbb1472f2a29b30b37cf00568a3792e6a18f
3
+ size 9654
runs/Jan05_20-55-51_3ab4673dd019/events.out.tfevents.1704488826.3ab4673dd019.445.7 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:fa23b2cf3d622d25dbd813d0eb6e9f2bfb555b95482d0baa8dd7da421cc134ab
3
+ size 517