Visual Question Answering
English
nms05 commited on
Commit
3d9ee9d
1 Parent(s): 8bb8a28

Delete README.md

Browse files
Files changed (1) hide show
  1. README.md +0 -9
README.md DELETED
@@ -1,9 +0,0 @@
1
- # DinoV2-SigLIP-Phi3(LoRA)
2
-
3
- ## Model and Dataset Details
4
-
5
- * **Vision Encoder** - DinoV2 + SigLIP @384px resolution.
6
- * **Connector** - MLP (Dino and SigLIP features are concatenated and then projected to Phi3 representation space)
7
- * **Language Model** - Phi3 + LoRA
8
- * **Pre-train (Align) Dataset** - LLaVA-CC3M-Pretrain-595K
9
- * **Fine-tune (Instruction) Dataset** - LLAVA-v1.5-Instruct + LRV-Instruct