DongfuJiang commited on
Commit
9fe2611
1 Parent(s): de1df2a

Model save

Browse files
README.md CHANGED
@@ -11,7 +11,7 @@ model-index:
11
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
12
  should probably proofread and complete it, then remove this comment. -->
13
 
14
- [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/dongfu/Mantis/runs/canpdqed)
15
  # mantis-8b-idefics2-video-eval-50k_4096
16
 
17
  This model is a fine-tuned version of [HuggingFaceM4/idefics2-8b](https://huggingface.co/HuggingFaceM4/idefics2-8b) on an unknown dataset.
@@ -33,7 +33,7 @@ More information needed
33
  ### Training hyperparameters
34
 
35
  The following hyperparameters were used during training:
36
- - learning_rate: 1e-05
37
  - train_batch_size: 1
38
  - eval_batch_size: 1
39
  - seed: 42
@@ -45,7 +45,7 @@ The following hyperparameters were used during training:
45
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
46
  - lr_scheduler_type: cosine
47
  - lr_scheduler_warmup_ratio: 0.03
48
- - num_epochs: 2.0
49
 
50
  ### Training results
51
 
 
11
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
12
  should probably proofread and complete it, then remove this comment. -->
13
 
14
+ [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/dongfu/Mantis/runs/0ssto7ph)
15
  # mantis-8b-idefics2-video-eval-50k_4096
16
 
17
  This model is a fine-tuned version of [HuggingFaceM4/idefics2-8b](https://huggingface.co/HuggingFaceM4/idefics2-8b) on an unknown dataset.
 
33
  ### Training hyperparameters
34
 
35
  The following hyperparameters were used during training:
36
+ - learning_rate: 5e-06
37
  - train_batch_size: 1
38
  - eval_batch_size: 1
39
  - seed: 42
 
45
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
46
  - lr_scheduler_type: cosine
47
  - lr_scheduler_warmup_ratio: 0.03
48
+ - num_epochs: 1.0
49
 
50
  ### Training results
51
 
model-00001-of-00004.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:230d362270462a21636821044b53a2747e5cc06b672a109f1624d0cd012a6fdf
3
  size 4966706832
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:155c09cc3348a1627ac8623d962f545cd22600c9d75905738b815257f0aed8c2
3
  size 4966706832
model-00002-of-00004.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:42919a5d75a03e32e613e8333960dade0f71f5d4f01d935b974cbfb3477652c0
3
  size 4915917232
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:610a4d2ab9bd94b8ebfcc1ebfa741ca178b0bc3227ca77731e1a79acd71fe3bf
3
  size 4915917232
model-00003-of-00004.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:ca0bacb4d8f00c9e38d0d60127f7b56a642b3bb66073ca897ef4943276bccb59
3
  size 4999820504
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ce8c2fd31662d85be030d0eb02d09a6b322dc051487564978f3f9fa05f9436a7
3
  size 4999820504
model-00004-of-00004.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:20c240e67bfb92c45620b4ad0ec5ff9fbd4528efc527712224f912617ae2bbe5
3
  size 1923190976
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b50bc4f69217d00b81d4aa3345c1ba5f70083f07f935062dd5829a3c58a44347
3
  size 1923190976