ToirovSadi commited on
Commit
98cb999
1 Parent(s): fe599f4

End of training

Browse files
README.md ADDED
@@ -0,0 +1,70 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - generated_from_trainer
4
+ metrics:
5
+ - rouge
6
+ - bleu
7
+ model-index:
8
+ - name: t5-small
9
+ results: []
10
+ ---
11
+
12
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
13
+ should probably proofread and complete it, then remove this comment. -->
14
+
15
+ # t5-small
16
+
17
+ This model was trained from scratch on an unknown dataset.
18
+ It achieves the following results on the evaluation set:
19
+ - Loss: 1.6644
20
+ - Rouge1: 58.4345
21
+ - Rouge2: 35.6298
22
+ - Rougel: 56.7347
23
+ - Rougelsum: 56.768
24
+ - Bleu: 0.2405
25
+
26
+ ## Model description
27
+
28
+ More information needed
29
+
30
+ ## Intended uses & limitations
31
+
32
+ More information needed
33
+
34
+ ## Training and evaluation data
35
+
36
+ More information needed
37
+
38
+ ## Training procedure
39
+
40
+ ### Training hyperparameters
41
+
42
+ The following hyperparameters were used during training:
43
+ - learning_rate: 5e-05
44
+ - train_batch_size: 64
45
+ - eval_batch_size: 64
46
+ - seed: 42
47
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
48
+ - lr_scheduler_type: linear
49
+ - num_epochs: 3
50
+
51
+ ### Training results
52
+
53
+ | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Bleu |
54
+ |:-------------:|:-----:|:-----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:------:|
55
+ | 1.8479 | 0.34 | 3000 | 1.7017 | 58.311 | 35.3602 | 56.618 | 56.6461 | 0.2379 |
56
+ | 1.8327 | 0.68 | 6000 | 1.6922 | 58.3222 | 35.4329 | 56.6269 | 56.6666 | 0.2385 |
57
+ | 1.8236 | 1.01 | 9000 | 1.6832 | 58.3368 | 35.42 | 56.6223 | 56.6611 | 0.2382 |
58
+ | 1.8046 | 1.35 | 12000 | 1.6784 | 58.4461 | 35.538 | 56.7423 | 56.779 | 0.2394 |
59
+ | 1.8078 | 1.69 | 15000 | 1.6718 | 58.4257 | 35.566 | 56.7209 | 56.7558 | 0.2394 |
60
+ | 1.7989 | 2.03 | 18000 | 1.6688 | 58.4162 | 35.6118 | 56.7173 | 56.7626 | 0.2402 |
61
+ | 1.7945 | 2.37 | 21000 | 1.6661 | 58.444 | 35.6166 | 56.7463 | 56.7845 | 0.2401 |
62
+ | 1.7907 | 2.71 | 24000 | 1.6644 | 58.4345 | 35.6298 | 56.7347 | 56.768 | 0.2405 |
63
+
64
+
65
+ ### Framework versions
66
+
67
+ - Transformers 4.36.2
68
+ - Pytorch 1.13.0+cu117
69
+ - Datasets 2.4.0
70
+ - Tokenizers 0.15.0
generation_config.json ADDED
@@ -0,0 +1,6 @@
 
 
 
 
 
 
 
1
+ {
2
+ "decoder_start_token_id": 0,
3
+ "eos_token_id": 1,
4
+ "pad_token_id": 0,
5
+ "transformers_version": "4.36.2"
6
+ }
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:78b30933df736e89141f98a0a7b33e79bbc0925a1eb748e1f21bdc8fc88a2973
3
  size 242041896
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4cd288c1dedb2a163f4c9f954192b4c920d33bfe716c3b04720ce21416234810
3
  size 242041896
runs/Dec29_15-50-45_n1y9t1aejz/events.out.tfevents.1703865068.n1y9t1aejz.383.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:f0b2f53e515799ec2f8807ed8e8f17f37ff995aa496ec749b10a41badd717826
3
- size 17896
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ec92774e5ba14d5a963f91bfa1f6b145504c3ba33f99077c66fd6bc94ca44476
3
+ size 18256