weizhiwang
/

LLaVA-Llama-3-8B

Text Generation

Inference Endpoints

Model card Files Files and versions Community

weizhiwang commited on Apr 20

Commit

4b62e65

•

1 Parent(s): bbb4d08

Create README.md

Files changed (1) hide show

README.md +24 -0

README.md ADDED Viewed

	@@ -0,0 +1,24 @@

+---
+license: cc
+datasets:
+- liuhaotian/LLaVA-Instruct-150K
+- liuhaotian/LLaVA-Pretrain
+language:
+- en
+---
+# Model Card for LLaVA-LLaMA-3-8B
+<!-- Provide a quick summary of what the model is/does. -->
+A reproduced LLaVA LVLM based on Llama-3-8B LLM backbone. Not an official implementation.
+## Model Details
+Follows LLavA-1.5 pre-train and supervised fine-tuning data.
+## Uses
+<!-- Address questions around how the model is intended to be used, including the foreseeable users of the model and those affected by the model. -->
+Please refer to a forked [LLaVA-Llama-3](https://github.com/Victorwz/LLaVA-Llama-3) git repo for usage. The data loading function and fastchat conversation template are changed due to a different tokenizer.