pasindu
/

vit-swin-base-224-gpt2-image-captioning

vision-encoder-decoder

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

pasindu commited on Jan 7

Commit

7651e76

•

1 Parent(s): 51ae78b

End of training

Files changed (3) hide show

README.md +15 -9
model.safetensors +1 -1
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -16,12 +16,12 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.1928
-- Rouge1: 41.8594
-- Rouge2: 28.7256
-- Rougel: 39.7537
-- Rougelsum: 39.7849
-- Bleu: 26.9506
 - Gen Len: 10.4161
 ## Model description
@@ -42,15 +42,21 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 5e-05
-- train_batch_size: 16
-- eval_batch_size: 16
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 10
 ### Training results
 ### Framework versions

 This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.0001
+- Rouge1: 99.2148
+- Rouge2: 99.1824
+- Rougel: 99.22
+- Rougelsum: 99.2169
+- Bleu: 96.4656
 - Gen Len: 10.4161
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 5e-05
+- train_batch_size: 8
+- eval_batch_size: 8
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 50
 ### Training results
+| Training Loss | Epoch | Step | Validation Loss | Rouge1  | Rouge2  | Rougel  | Rougelsum | Bleu    | Gen Len |
+|:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:-------:|:-------:|
+| 0.622         | 11.36 | 2000 | 0.0330          | 91.0769 | 88.8333 | 90.7025 | 90.7277   | 84.8472 | 10.4161 |
+| 0.0547        | 22.73 | 4000 | 0.0015          | 99.0694 | 98.9636 | 99.0615 | 99.0613   | 96.1312 | 10.4161 |
+| 0.0238        | 34.09 | 6000 | 0.0007          | 99.1681 | 99.0942 | 99.167  | 99.1646   | 96.3754 | 10.4161 |
+| 0.0046        | 45.45 | 8000 | 0.0001          | 99.2225 | 99.1781 | 99.217  | 99.2171   | 96.4412 | 10.4161 |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:b72dfdd6d5e0fd3d90dc0a99cefaa52fe7f6d0977523f6bbd6e4a36bbdae2da5
 size 961895704

 version https://git-lfs.github.com/spec/v1
+oid sha256:f8965bf21a35ad1bef89660da6109ce97fdbde9e5613eaa229a11b34957f7e45
 size 961895704

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:fbc644fa57e244cfbb167532710a4685b8a8806f697d9e94421a73280c6a2fe8
 size 4728

 version https://git-lfs.github.com/spec/v1
+oid sha256:80bf136e783ae49bce88a33df057a85e1824527ffa492072a601be6e907f9095
 size 4728