Model save

Browse files

Files changed (3) hide show

README.md +16 -16
model-00001-of-00002.safetensors +1 -1
runs/May26_17-49-30_ae63705f58eb/events.out.tfevents.1716745774.ae63705f58eb.59466.0 +2 -2

README.md CHANGED Viewed

@@ -13,12 +13,12 @@ model-index:
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/statking/huggingface/runs/5b5skvtb)
 # paligemma-vqa
 This model is a fine-tuned version of [google/paligemma-3b-pt-224](https://huggingface.co/google/paligemma-3b-pt-224) on the vq_av2 dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.9226
 ## Model description
@@ -37,7 +37,7 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 8e-06
 - train_batch_size: 16
 - eval_batch_size: 16
 - seed: 42
@@ -52,19 +52,19 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
-| 20.2791       | 0.0736 | 500  | 19.3371         |
-| 5.4004        | 0.1472 | 1000 | 4.8792          |
-| 1.5853        | 0.2207 | 1500 | 1.4809          |
-| 1.091         | 0.2943 | 2000 | 1.0661          |
-| 0.9667        | 0.3679 | 2500 | 0.9655          |
-| 0.9449        | 0.4415 | 3000 | 0.9356          |
-| 0.9241        | 0.5151 | 3500 | 0.9270          |
-| 0.9295        | 0.5886 | 4000 | 0.9238          |
-| 0.922         | 0.6622 | 4500 | 0.9228          |
-| 0.9103        | 0.7358 | 5000 | 0.9229          |
-| 0.9225        | 0.8094 | 5500 | 0.9225          |
-| 0.9159        | 0.8830 | 6000 | 0.9223          |
-| 0.934         | 0.9566 | 6500 | 0.9226          |
 ### Framework versions

 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/statking/huggingface/runs/xgb0dent)
 # paligemma-vqa
 This model is a fine-tuned version of [google/paligemma-3b-pt-224](https://huggingface.co/google/paligemma-3b-pt-224) on the vq_av2 dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.0001
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 0.02
 - train_batch_size: 16
 - eval_batch_size: 16
 - seed: 42
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
+| 0.0019        | 0.0736 | 500  | 0.0081          |
+| 0.0004        | 0.1472 | 1000 | 0.0002          |
+| 0.0003        | 0.2207 | 1500 | 0.0002          |
+| 0.0001        | 0.2943 | 2000 | 0.0001          |
+| 0.0001        | 0.3679 | 2500 | 0.0001          |
+| 0.0001        | 0.4415 | 3000 | 0.0001          |
+| 0.0002        | 0.5151 | 3500 | 0.0002          |
+| 0.0001        | 0.5886 | 4000 | 0.0001          |
+| 0.0001        | 0.6622 | 4500 | 0.0001          |
+| 0.0001        | 0.7358 | 5000 | 0.0001          |
+| 0.0001        | 0.8094 | 5500 | 0.0001          |
+| 0.0001        | 0.8830 | 6000 | 0.0001          |
+| 0.0001        | 0.9566 | 6500 | 0.0001          |
 ### Framework versions

model-00001-of-00002.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:f37fc24dcff6177b9fe739f3b9f243cec8fa0270f95b9f1a680dd840f99ef44a
 size 4985044392

 version https://git-lfs.github.com/spec/v1
+oid sha256:ef4a309cb1f2d2924746fbed6a1a29acdaa0f12a71402c25f9717a2526f97439
 size 4985044392

runs/May26_17-49-30_ae63705f58eb/events.out.tfevents.1716745774.ae63705f58eb.59466.0 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:54c6a64217a4207d0c77f5dafcfdc5e58ead85c6f7f3c81f2246e40d50e52afb
-size 21042

 version https://git-lfs.github.com/spec/v1
+oid sha256:ce3a5e9a4a8a6869f47b2a33a1a2ba26c7de83603763985c669f5a67faffca53
+size 23144