gimarchetti commited on
Commit
2b8d399
1 Parent(s): 9f8efc8

End of training

Browse files
Files changed (3) hide show
  1. README.md +14 -11
  2. adapter_model.safetensors +1 -1
  3. training_args.bin +1 -1
README.md CHANGED
@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  This model is a fine-tuned version of [HuggingFaceM4/idefics2-8b](https://huggingface.co/HuggingFaceM4/idefics2-8b) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
- - Loss: 0.6701
19
 
20
  ## Model description
21
 
@@ -34,7 +34,7 @@ More information needed
34
  ### Training hyperparameters
35
 
36
  The following hyperparameters were used during training:
37
- - learning_rate: 5e-05
38
  - train_batch_size: 2
39
  - eval_batch_size: 2
40
  - seed: 42
@@ -48,15 +48,18 @@ The following hyperparameters were used during training:
48
 
49
  ### Training results
50
 
51
- | Training Loss | Epoch | Step | Validation Loss |
52
- |:-------------:|:------:|:----:|:---------------:|
53
- | 1.5733 | 0.2632 | 50 | 0.7737 |
54
- | 0.7812 | 0.5263 | 100 | 0.7129 |
55
- | 0.7542 | 0.7895 | 150 | 0.6916 |
56
- | 0.7098 | 1.0526 | 200 | 0.6835 |
57
- | 0.6144 | 1.3158 | 250 | 0.6822 |
58
- | 0.6208 | 1.5789 | 300 | 0.6754 |
59
- | 0.6054 | 1.8421 | 350 | 0.6701 |
 
 
 
60
 
61
 
62
  ### Framework versions
 
15
 
16
  This model is a fine-tuned version of [HuggingFaceM4/idefics2-8b](https://huggingface.co/HuggingFaceM4/idefics2-8b) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 0.6739
19
 
20
  ## Model description
21
 
 
34
  ### Training hyperparameters
35
 
36
  The following hyperparameters were used during training:
37
+ - learning_rate: 0.0001
38
  - train_batch_size: 2
39
  - eval_batch_size: 2
40
  - seed: 42
 
48
 
49
  ### Training results
50
 
51
+ | Training Loss | Epoch | Step | Validation Loss |
52
+ |:-------------:|:-----:|:----:|:---------------:|
53
+ | 1.5316 | 0.2 | 38 | 0.7854 |
54
+ | 0.7931 | 0.4 | 76 | 0.7384 |
55
+ | 0.8019 | 0.6 | 114 | 0.7167 |
56
+ | 0.7487 | 0.8 | 152 | 0.6992 |
57
+ | 0.7416 | 1.0 | 190 | 0.6887 |
58
+ | 0.5919 | 1.2 | 228 | 0.6977 |
59
+ | 0.5819 | 1.4 | 266 | 0.6903 |
60
+ | 0.5948 | 1.6 | 304 | 0.6849 |
61
+ | 0.5858 | 1.8 | 342 | 0.6780 |
62
+ | 0.5539 | 2.0 | 380 | 0.6739 |
63
 
64
 
65
  ### Framework versions
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:ddaa2b63c2de37c51d8b5f0be9eb57c4d814624f3df53f9d4114f09d356768d4
3
  size 93378688
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5c4430b955fd1bcca51748433e8bd18ac08a1a0b1a446165741e1c9b3e91b671
3
  size 93378688
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:5603dac67b54b37a55b6aa4f9c7e84a2c6fd0b7b90c1c181a01a35e8a8029e0f
3
  size 4731
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1920691e93e1b989f360a2f842bdbbf92dc0386365f49826cf716a94f45d7f64
3
  size 4731