statking commited on
Commit
851e4e8
1 Parent(s): 68b4c17

Model save

Browse files
README.md CHANGED
@@ -13,12 +13,12 @@ model-index:
13
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
14
  should probably proofread and complete it, then remove this comment. -->
15
 
16
- [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/statking/huggingface/runs/5b5skvtb)
17
  # paligemma-vqa
18
 
19
  This model is a fine-tuned version of [google/paligemma-3b-pt-224](https://huggingface.co/google/paligemma-3b-pt-224) on the vq_av2 dataset.
20
  It achieves the following results on the evaluation set:
21
- - Loss: 0.9226
22
 
23
  ## Model description
24
 
@@ -37,7 +37,7 @@ More information needed
37
  ### Training hyperparameters
38
 
39
  The following hyperparameters were used during training:
40
- - learning_rate: 8e-06
41
  - train_batch_size: 16
42
  - eval_batch_size: 16
43
  - seed: 42
@@ -52,19 +52,19 @@ The following hyperparameters were used during training:
52
 
53
  | Training Loss | Epoch | Step | Validation Loss |
54
  |:-------------:|:------:|:----:|:---------------:|
55
- | 20.2791 | 0.0736 | 500 | 19.3371 |
56
- | 5.4004 | 0.1472 | 1000 | 4.8792 |
57
- | 1.5853 | 0.2207 | 1500 | 1.4809 |
58
- | 1.091 | 0.2943 | 2000 | 1.0661 |
59
- | 0.9667 | 0.3679 | 2500 | 0.9655 |
60
- | 0.9449 | 0.4415 | 3000 | 0.9356 |
61
- | 0.9241 | 0.5151 | 3500 | 0.9270 |
62
- | 0.9295 | 0.5886 | 4000 | 0.9238 |
63
- | 0.922 | 0.6622 | 4500 | 0.9228 |
64
- | 0.9103 | 0.7358 | 5000 | 0.9229 |
65
- | 0.9225 | 0.8094 | 5500 | 0.9225 |
66
- | 0.9159 | 0.8830 | 6000 | 0.9223 |
67
- | 0.934 | 0.9566 | 6500 | 0.9226 |
68
 
69
 
70
  ### Framework versions
 
13
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
14
  should probably proofread and complete it, then remove this comment. -->
15
 
16
+ [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/statking/huggingface/runs/xgb0dent)
17
  # paligemma-vqa
18
 
19
  This model is a fine-tuned version of [google/paligemma-3b-pt-224](https://huggingface.co/google/paligemma-3b-pt-224) on the vq_av2 dataset.
20
  It achieves the following results on the evaluation set:
21
+ - Loss: 0.0001
22
 
23
  ## Model description
24
 
 
37
  ### Training hyperparameters
38
 
39
  The following hyperparameters were used during training:
40
+ - learning_rate: 0.02
41
  - train_batch_size: 16
42
  - eval_batch_size: 16
43
  - seed: 42
 
52
 
53
  | Training Loss | Epoch | Step | Validation Loss |
54
  |:-------------:|:------:|:----:|:---------------:|
55
+ | 0.0019 | 0.0736 | 500 | 0.0081 |
56
+ | 0.0004 | 0.1472 | 1000 | 0.0002 |
57
+ | 0.0003 | 0.2207 | 1500 | 0.0002 |
58
+ | 0.0001 | 0.2943 | 2000 | 0.0001 |
59
+ | 0.0001 | 0.3679 | 2500 | 0.0001 |
60
+ | 0.0001 | 0.4415 | 3000 | 0.0001 |
61
+ | 0.0002 | 0.5151 | 3500 | 0.0002 |
62
+ | 0.0001 | 0.5886 | 4000 | 0.0001 |
63
+ | 0.0001 | 0.6622 | 4500 | 0.0001 |
64
+ | 0.0001 | 0.7358 | 5000 | 0.0001 |
65
+ | 0.0001 | 0.8094 | 5500 | 0.0001 |
66
+ | 0.0001 | 0.8830 | 6000 | 0.0001 |
67
+ | 0.0001 | 0.9566 | 6500 | 0.0001 |
68
 
69
 
70
  ### Framework versions
model-00001-of-00002.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:f37fc24dcff6177b9fe739f3b9f243cec8fa0270f95b9f1a680dd840f99ef44a
3
  size 4985044392
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ef4a309cb1f2d2924746fbed6a1a29acdaa0f12a71402c25f9717a2526f97439
3
  size 4985044392
runs/May26_17-49-30_ae63705f58eb/events.out.tfevents.1716745774.ae63705f58eb.59466.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:54c6a64217a4207d0c77f5dafcfdc5e58ead85c6f7f3c81f2246e40d50e52afb
3
- size 21042
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ce3a5e9a4a8a6869f47b2a33a1a2ba26c7de83603763985c669f5a67faffca53
3
+ size 23144