Delete README.md
Browse files
README.md
DELETED
@@ -1,9 +0,0 @@
|
|
1 |
-
# DinoV2-SigLIP-Phi3(LoRA)
|
2 |
-
|
3 |
-
## Model and Dataset Details
|
4 |
-
|
5 |
-
* **Vision Encoder** - DinoV2 + SigLIP @384px resolution.
|
6 |
-
* **Connector** - MLP (Dino and SigLIP features are concatenated and then projected to Phi3 representation space)
|
7 |
-
* **Language Model** - Phi3 + LoRA
|
8 |
-
* **Pre-train (Align) Dataset** - LLaVA-CC3M-Pretrain-595K
|
9 |
-
* **Fine-tune (Instruction) Dataset** - LLAVA-v1.5-Instruct + LRV-Instruct
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|