InfinityC commited on
Commit
348a2bd
1 Parent(s): 25c97b7

End of training

Browse files
README.md ADDED
@@ -0,0 +1,68 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ base_model: google-t5/t5-small
4
+ tags:
5
+ - generated_from_trainer
6
+ metrics:
7
+ - rouge
8
+ model-index:
9
+ - name: test_sum_abs_t5_small_wasa_stops
10
+ results: []
11
+ ---
12
+
13
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
14
+ should probably proofread and complete it, then remove this comment. -->
15
+
16
+ # test_sum_abs_t5_small_wasa_stops
17
+
18
+ This model is a fine-tuned version of [google-t5/t5-small](https://huggingface.co/google-t5/t5-small) on the None dataset.
19
+ It achieves the following results on the evaluation set:
20
+ - Loss: 0.8601
21
+ - Rouge1: 0.3823
22
+ - Rouge2: 0.2702
23
+ - Rougel: 0.3451
24
+ - Rougelsum: 0.3454
25
+ - Gen Len: 18.9864
26
+
27
+ ## Model description
28
+
29
+ More information needed
30
+
31
+ ## Intended uses & limitations
32
+
33
+ More information needed
34
+
35
+ ## Training and evaluation data
36
+
37
+ More information needed
38
+
39
+ ## Training procedure
40
+
41
+ ### Training hyperparameters
42
+
43
+ The following hyperparameters were used during training:
44
+ - learning_rate: 2e-05
45
+ - train_batch_size: 16
46
+ - eval_batch_size: 16
47
+ - seed: 42
48
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
49
+ - lr_scheduler_type: linear
50
+ - num_epochs: 4
51
+ - mixed_precision_training: Native AMP
52
+
53
+ ### Training results
54
+
55
+ | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
56
+ |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|:-------:|
57
+ | 1.0591 | 1.0 | 1764 | 0.9275 | 0.3767 | 0.2652 | 0.3403 | 0.3404 | 18.9787 |
58
+ | 0.9758 | 2.0 | 3528 | 0.8813 | 0.3817 | 0.2702 | 0.3448 | 0.345 | 18.9819 |
59
+ | 0.9575 | 3.0 | 5292 | 0.8648 | 0.3818 | 0.2692 | 0.3445 | 0.3446 | 18.987 |
60
+ | 0.9435 | 4.0 | 7056 | 0.8601 | 0.3823 | 0.2702 | 0.3451 | 0.3454 | 18.9864 |
61
+
62
+
63
+ ### Framework versions
64
+
65
+ - Transformers 4.39.3
66
+ - Pytorch 2.1.2
67
+ - Datasets 2.18.0
68
+ - Tokenizers 0.15.2
generation_config.json ADDED
@@ -0,0 +1,6 @@
 
 
 
 
 
 
 
1
+ {
2
+ "decoder_start_token_id": 0,
3
+ "eos_token_id": 1,
4
+ "pad_token_id": 0,
5
+ "transformers_version": "4.39.3"
6
+ }
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:8ea6e1e6213a910e494712eff6149c37e23fe563e245d5580e9141d4553247f1
3
  size 242041896
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7f47024ed69387561ca9c5de0b09d657f56ab98a783b79f4c0d9260f1dd8a369
3
  size 242041896
runs/May09_01-33-23_17460881c74e/events.out.tfevents.1715218405.17460881c74e.34.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:cceb52a606f4828605bf1bbcfba3bbba63555d5d8c81f15c5365c16dfdc65166
3
- size 10176
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9d63ae5229bf43a51f45bdfb77c416d5d4c7d923a5855a953c310cd2f87693f8
3
+ size 11055