End of training

Files changed (3) hide show

README.md CHANGED Viewed

@@ -1,15 +1,13 @@
 ---
-base_model: vidore/colpaligemma-3b-pt-448-base
-library_name: peft
 license: gemma
 tags:
 - generated_from_trainer
-- ColPali
 model-index:
 - name: finetune_colpali_v1_2-ufo-4bit
   results: []
-datasets:
-- davanstrien/ufo-ColPali
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -17,9 +15,10 @@ should probably proofread and complete it, then remove this comment. -->
 # finetune_colpali_v1_2-ufo-4bit
-This model is a fine-tuned version of [vidore/colpaligemma-3b-pt-448-base](https://huggingface.co/vidore/colpaligemma-3b-pt-448-base) on the [davanstrien/ufo-ColPali](https://huggingface.co/datasets/davanstrien/ufo-ColPali) dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.1368
 ## Model description
@@ -46,22 +45,22 @@ The following hyperparameters were used during training:
 - total_train_batch_size: 8
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- lr_scheduler_warmup_steps: 50
-- num_epochs: 2
 ### Training results
-| Training Loss | Epoch  | Step | Validation Loss |
-|:-------------:|:------:|:----:|:---------------:|
-| No log        | 0.0041 | 1    | 0.7493          |
-| 0.2096        | 0.8180 | 200  | 0.1816          |
-| 0.0825        | 1.6360 | 400  | 0.1397          |
 ### Framework versions
-- PEFT 0.11.1
 - Transformers 4.44.2
 - Pytorch 2.4.1+cu121
 - Datasets 3.0.0
-- Tokenizers 0.19.1

 ---
+library_name: transformers
 license: gemma
+base_model: vidore/colpaligemma-3b-pt-448-base
 tags:
+- colpali
 - generated_from_trainer
 model-index:
 - name: finetune_colpali_v1_2-ufo-4bit
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 # finetune_colpali_v1_2-ufo-4bit
+This model is a fine-tuned version of [vidore/colpaligemma-3b-pt-448-base](https://huggingface.co/vidore/colpaligemma-3b-pt-448-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.1064
+- Model Preparation Time: 0.0056
 ## Model description
 - total_train_batch_size: 8
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- lr_scheduler_warmup_steps: 100
+- num_epochs: 1.5
 ### Training results
+| Training Loss | Epoch  | Step | Validation Loss | Model Preparation Time |
+|:-------------:|:------:|:----:|:---------------:|:----------------------:|
+| No log        | 0.0041 | 1    | 0.1879          | 0.0056                 |
+| 0.1193        | 0.4090 | 100  | 0.1136          | 0.0056                 |
+| 0.1287        | 0.8180 | 200  | 0.1122          | 0.0056                 |
+| 0.0662        | 1.2270 | 300  | 0.1063          | 0.0056                 |
 ### Framework versions
 - Transformers 4.44.2
 - Pytorch 2.4.1+cu121
 - Datasets 3.0.0
+- Tokenizers 0.19.1

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:f26920f3db772a13dae7d14145db48c03045facbfad7ebe91741a55895e0db6b
 size 157071680

 version https://git-lfs.github.com/spec/v1
+oid sha256:4628eb8863cc29a8454b84ba05230947cddc40973f73f0147c3ff550f1655c97
 size 157071680

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:d07db1980529ecc4c2ed47934017ff6d8f82ad485fa95b5f9ca4af0c2673f82d
-size 5176

 version https://git-lfs.github.com/spec/v1
+oid sha256:f6dc70ef2d489a8733ed6dce697261494ae1f7fb1afec04738b4fa1020cdc095
+size 5240