nms05
/

Dinov2-SigLIP-Phi3-LoRA

Visual Question Answering

Model card Files Files and versions Community

nms05 commited on May 23

Commit

3d9ee9d

•

1 Parent(s): 8bb8a28

Delete README.md

Files changed (1) hide show

README.md +0 -9

README.md DELETED Viewed

@@ -1,9 +0,0 @@
-# DinoV2-SigLIP-Phi3(LoRA)
-## Model and Dataset Details
-* **Vision Encoder** - DinoV2 + SigLIP @384px resolution.
-* **Connector** - MLP (Dino and SigLIP features are concatenated and then projected to Phi3 representation space)
-* **Language Model** - Phi3 + LoRA
-* **Pre-train (Align) Dataset** - LLaVA-CC3M-Pretrain-595K
-* **Fine-tune (Instruction) Dataset** - LLAVA-v1.5-Instruct + LRV-Instruct