weizhiwang
commited on
Commit
•
4b62e65
1
Parent(s):
bbb4d08
Create README.md
Browse files
README.md
ADDED
@@ -0,0 +1,24 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
license: cc
|
3 |
+
datasets:
|
4 |
+
- liuhaotian/LLaVA-Instruct-150K
|
5 |
+
- liuhaotian/LLaVA-Pretrain
|
6 |
+
language:
|
7 |
+
- en
|
8 |
+
---
|
9 |
+
|
10 |
+
# Model Card for LLaVA-LLaMA-3-8B
|
11 |
+
|
12 |
+
<!-- Provide a quick summary of what the model is/does. -->
|
13 |
+
|
14 |
+
A reproduced LLaVA LVLM based on Llama-3-8B LLM backbone. Not an official implementation.
|
15 |
+
|
16 |
+
## Model Details
|
17 |
+
Follows LLavA-1.5 pre-train and supervised fine-tuning data.
|
18 |
+
|
19 |
+
## Uses
|
20 |
+
|
21 |
+
<!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
|
22 |
+
|
23 |
+
Please refer to a forked [LLaVA-Llama-3](https://github.com/Victorwz/LLaVA-Llama-3) git repo for usage. The data loading function and fastchat conversation template are changed due to a different tokenizer.
|
24 |
+
|