File size: 382 Bytes
a6ac944
 
 
d47a54c
 
 
 
 
 
581f172
7891887
d47a54c
 
 
7891887
581f172
7891887
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
---
license: apache-2.0
---
**Base Model**: BLIP2-t5 pretrained version

**Finetune data**: LLAVA 150k (sample one pair of instruction-answer if multi-round conversations)

**Hyper-parameters**: 

v0:
* lr = 2e-5 --> 0.0 with cosine lr scheduler
* gbs = 32
* image size = 480
* weight decay = 0.05

v1 (same as LLAVA):
* lr = 2e-5
* gbs = 32
* image size = 480
* weight decay = 0.0