Model save
Browse files
README.md
CHANGED
@@ -1,8 +1,10 @@
|
|
1 |
---
|
2 |
license: apache-2.0
|
|
|
3 |
tags:
|
4 |
- generated_from_trainer
|
5 |
-
|
|
|
6 |
model-index:
|
7 |
- name: Vit-GPT2-COCO2017Flickr-80k-08
|
8 |
results: []
|
@@ -13,19 +15,14 @@ should probably proofread and complete it, then remove this comment. -->
|
|
13 |
|
14 |
# Vit-GPT2-COCO2017Flickr-80k-08
|
15 |
|
16 |
-
This model is a fine-tuned version of [NourFakih/Vit-GPT2-COCO2017Flickr-
|
17 |
It achieves the following results on the evaluation set:
|
18 |
-
-
|
19 |
-
-
|
20 |
-
-
|
21 |
-
-
|
22 |
-
-
|
23 |
-
-
|
24 |
-
- eval_runtime: 459.5392
|
25 |
-
- eval_samples_per_second: 8.704
|
26 |
-
- eval_steps_per_second: 2.176
|
27 |
-
- epoch: 0.1
|
28 |
-
- step: 500
|
29 |
|
30 |
## Model description
|
31 |
|
@@ -54,9 +51,45 @@ The following hyperparameters were used during training:
|
|
54 |
- lr_scheduler_type: linear
|
55 |
- num_epochs: 3.0
|
56 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
57 |
### Framework versions
|
58 |
|
59 |
-
- Transformers 4.
|
60 |
- Pytorch 2.1.2
|
61 |
-
- Datasets 2.
|
62 |
-
- Tokenizers 0.
|
|
|
1 |
---
|
2 |
license: apache-2.0
|
3 |
+
base_model: NourFakih/Vit-GPT2-COCO2017Flickr-80k-08
|
4 |
tags:
|
5 |
- generated_from_trainer
|
6 |
+
metrics:
|
7 |
+
- rouge
|
8 |
model-index:
|
9 |
- name: Vit-GPT2-COCO2017Flickr-80k-08
|
10 |
results: []
|
|
|
15 |
|
16 |
# Vit-GPT2-COCO2017Flickr-80k-08
|
17 |
|
18 |
+
This model is a fine-tuned version of [NourFakih/Vit-GPT2-COCO2017Flickr-80k-08](https://huggingface.co/NourFakih/Vit-GPT2-COCO2017Flickr-80k-08) on an unknown dataset.
|
19 |
It achieves the following results on the evaluation set:
|
20 |
+
- Gen Len: 12.0243
|
21 |
+
- Loss: 0.5354
|
22 |
+
- Rouge1: 40.114
|
23 |
+
- Rouge2: 14.6699
|
24 |
+
- Rougel: 36.1001
|
25 |
+
- Rougelsum: 36.1128
|
|
|
|
|
|
|
|
|
|
|
26 |
|
27 |
## Model description
|
28 |
|
|
|
51 |
- lr_scheduler_type: linear
|
52 |
- num_epochs: 3.0
|
53 |
|
54 |
+
### Training results
|
55 |
+
|
56 |
+
| Training Loss | Epoch | Step | Gen Len | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum |
|
57 |
+
|:-------------:|:-----:|:-----:|:-------:|:---------------:|:-------:|:-------:|:-------:|:---------:|
|
58 |
+
| 0.3691 | 0.1 | 500 | 11.7758 | 0.4730 | 39.8086 | 14.7674 | 36.1546 | 36.1739 |
|
59 |
+
| 0.3706 | 0.2 | 1000 | 11.5977 | 0.4739 | 39.8972 | 14.9064 | 36.1193 | 36.138 |
|
60 |
+
| 0.3709 | 0.3 | 1500 | 11.7103 | 0.4759 | 39.9874 | 14.8528 | 36.3155 | 36.3317 |
|
61 |
+
| 0.3721 | 0.4 | 2000 | 12.175 | 0.4678 | 39.7192 | 14.5844 | 35.8447 | 35.8728 |
|
62 |
+
| 0.3655 | 0.5 | 2500 | 11.9002 | 0.4684 | 40.3132 | 15.1157 | 36.5749 | 36.5823 |
|
63 |
+
| 0.3623 | 0.6 | 3000 | 12.025 | 0.4672 | 40.1643 | 14.978 | 36.3002 | 36.3232 |
|
64 |
+
| 0.3676 | 0.7 | 3500 | 11.815 | 0.4623 | 40.5036 | 15.3751 | 36.8369 | 36.867 |
|
65 |
+
| 0.3613 | 0.8 | 4000 | 12.054 | 0.4647 | 40.4078 | 15.3105 | 36.65 | 36.6732 |
|
66 |
+
| 0.3539 | 0.9 | 4500 | 11.904 | 0.4634 | 40.3794 | 15.233 | 36.7155 | 36.7435 |
|
67 |
+
| 0.3481 | 1.0 | 5000 | 11.738 | 0.4644 | 40.037 | 14.8477 | 36.3648 | 36.3903 |
|
68 |
+
| 0.2889 | 1.1 | 5500 | 11.55 | 0.4897 | 40.1394 | 14.7595 | 36.4428 | 36.4696 |
|
69 |
+
| 0.2908 | 1.2 | 6000 | 11.9823 | 0.4865 | 40.0479 | 14.8181 | 36.316 | 36.3519 |
|
70 |
+
| 0.2882 | 1.3 | 6500 | 11.7945 | 0.4863 | 40.5912 | 15.3128 | 36.7638 | 36.7755 |
|
71 |
+
| 0.2901 | 1.4 | 7000 | 11.87 | 0.4868 | 40.3138 | 14.9695 | 36.5032 | 36.5211 |
|
72 |
+
| 0.2857 | 1.5 | 7500 | 11.776 | 0.4834 | 40.2242 | 14.9881 | 36.5381 | 36.5607 |
|
73 |
+
| 0.279 | 1.6 | 8000 | 12.0132 | 0.4999 | 40.2751 | 15.0173 | 36.4172 | 36.4257 |
|
74 |
+
| 0.281 | 1.7 | 8500 | 11.7685 | 0.4951 | 40.1172 | 14.8119 | 36.2966 | 36.296 |
|
75 |
+
| 0.2831 | 1.8 | 9000 | 12.2293 | 0.4979 | 39.9913 | 14.7427 | 36.1539 | 36.1517 |
|
76 |
+
| 0.2799 | 1.9 | 9500 | 11.8718 | 0.4911 | 40.5123 | 15.09 | 36.7528 | 36.7622 |
|
77 |
+
| 0.2778 | 2.0 | 10000 | 12.0262 | 0.4929 | 40.5005 | 15.1027 | 36.6202 | 36.6327 |
|
78 |
+
| 0.2318 | 2.1 | 10500 | 12.133 | 0.5237 | 40.1565 | 14.8022 | 36.1946 | 36.2074 |
|
79 |
+
| 0.2279 | 2.2 | 11000 | 11.92 | 0.5278 | 40.5801 | 15.0843 | 36.7832 | 36.8021 |
|
80 |
+
| 0.2272 | 2.3 | 11500 | 11.8057 | 0.5284 | 40.2332 | 14.8728 | 36.4401 | 36.4343 |
|
81 |
+
| 0.2308 | 2.4 | 12000 | 11.9518 | 0.5263 | 39.9961 | 14.6475 | 36.035 | 36.0528 |
|
82 |
+
| 0.2262 | 2.5 | 12500 | 11.9347 | 0.5322 | 40.3373 | 14.9137 | 36.3692 | 36.3718 |
|
83 |
+
| 0.2233 | 2.6 | 13000 | 11.9147 | 0.5329 | 40.1924 | 14.776 | 36.1644 | 36.1593 |
|
84 |
+
| 0.223 | 2.7 | 13500 | 11.9927 | 0.5370 | 40.3211 | 14.9563 | 36.3211 | 36.3345 |
|
85 |
+
| 0.2241 | 2.8 | 14000 | 11.9367 | 0.5365 | 40.0897 | 14.6372 | 36.1484 | 36.1606 |
|
86 |
+
| 0.2257 | 2.9 | 14500 | 12.0407 | 0.5332 | 40.2316 | 14.741 | 36.1795 | 36.1866 |
|
87 |
+
| 0.2201 | 3.0 | 15000 | 12.0243 | 0.5354 | 40.114 | 14.6699 | 36.1001 | 36.1128 |
|
88 |
+
|
89 |
+
|
90 |
### Framework versions
|
91 |
|
92 |
+
- Transformers 4.41.2
|
93 |
- Pytorch 2.1.2
|
94 |
+
- Datasets 2.19.2
|
95 |
+
- Tokenizers 0.19.1
|
runs/Jun09_12-04-01_99b35e8a5856/events.out.tfevents.1717934643.99b35e8a5856.34.0
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:977bb3e5940fac9b6d05ea26c86380d10d6720c5b911117b73941b9f994dbce3
|
3 |
+
size 9634
|
runs/Jun09_12-04-14_99b35e8a5856/events.out.tfevents.1717934656.99b35e8a5856.34.1
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:f54e9e33a1e9a69dbb1648ff47e938fc40b045991e79635356011a2fd58fd0f0
|
3 |
+
size 9634
|
training_args.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 5304
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:d9cc8cdfce8093a08341f6f93f59d6c55bd464c1922a7ad81d7f9c35c5698b3c
|
3 |
size 5304
|