wgcv commited on
Commit
66a2d86
1 Parent(s): 6a8fb20

End of training

Browse files
README.md ADDED
@@ -0,0 +1,64 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model: google/pegasus-xsum
3
+ tags:
4
+ - generated_from_trainer
5
+ metrics:
6
+ - rouge
7
+ model-index:
8
+ - name: tidy-tab-model-pegasus-xsum
9
+ results: []
10
+ ---
11
+
12
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
13
+ should probably proofread and complete it, then remove this comment. -->
14
+
15
+ # tidy-tab-model-pegasus-xsum
16
+
17
+ This model is a fine-tuned version of [google/pegasus-xsum](https://huggingface.co/google/pegasus-xsum) on an unknown dataset.
18
+ It achieves the following results on the evaluation set:
19
+ - Loss: 0.9644
20
+ - Rouge1: 0.7456
21
+ - Rouge2: 0.6153
22
+ - Rougel: 0.7401
23
+ - Rougelsum: 0.7422
24
+ - Gen Len: 5.2607
25
+
26
+ ## Model description
27
+
28
+ More information needed
29
+
30
+ ## Intended uses & limitations
31
+
32
+ More information needed
33
+
34
+ ## Training and evaluation data
35
+
36
+ More information needed
37
+
38
+ ## Training procedure
39
+
40
+ ### Training hyperparameters
41
+
42
+ The following hyperparameters were used during training:
43
+ - learning_rate: 3e-05
44
+ - train_batch_size: 16
45
+ - eval_batch_size: 16
46
+ - seed: 42
47
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
48
+ - lr_scheduler_type: linear
49
+ - num_epochs: 8
50
+
51
+ ### Training results
52
+
53
+ | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
54
+ |:-------------:|:------:|:----:|:---------------:|:------:|:------:|:------:|:---------:|:-------:|
55
+ | 1.5893 | 3.7879 | 500 | 1.0234 | 0.7302 | 0.594 | 0.7229 | 0.7244 | 5.3034 |
56
+ | 0.9308 | 7.5758 | 1000 | 0.9644 | 0.7456 | 0.6153 | 0.7401 | 0.7422 | 5.2607 |
57
+
58
+
59
+ ### Framework versions
60
+
61
+ - Transformers 4.41.2
62
+ - Pytorch 2.3.0+cu121
63
+ - Datasets 2.20.0
64
+ - Tokenizers 0.19.1
generation_config.json ADDED
@@ -0,0 +1,11 @@
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "bos_token_id": 0,
3
+ "decoder_start_token_id": 0,
4
+ "eos_token_id": 1,
5
+ "forced_eos_token_id": 1,
6
+ "length_penalty": 0.6,
7
+ "max_length": 64,
8
+ "num_beams": 8,
9
+ "pad_token_id": 0,
10
+ "transformers_version": "4.41.2"
11
+ }
runs/Jul09_15-57-38_c5eadc05cc54/events.out.tfevents.1720540660.c5eadc05cc54.3138.1 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:1a2df304835658832d1f2e6d860aaaba6d7d498bca4a2f5b6ca6d883cfb8647c
3
- size 7263
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c2181d44a4ce899867541b09aad79995ef00327c16b25f0d70a220c5e3354c46
3
+ size 7617