cjpais commited on
Commit
ec00f7e
1 Parent(s): ff0ea29

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +63 -1
README.md CHANGED
@@ -1,7 +1,69 @@
1
  ---
 
2
  tags:
3
  - llava
4
  pipeline_tag: image-text-to-text
5
  ---
6
 
7
- # Model Card Title
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
+ license: apache-2.0
3
  tags:
4
  - llava
5
  pipeline_tag: image-text-to-text
6
  ---
7
 
8
+ # GGUF Quantized LLaVA 1.6 Vicuna 13B
9
+
10
+ Updated quants and projector from [PR #5267](https://github.com/ggerganov/llama.cpp/pull/5267)
11
+
12
+ | Name | Quant method | Bits | Size | Use case |
13
+ | ---- | ---- | ---- | ---- | ----- |
14
+ | [llava-v1.6-vicuna-13b.Q3_K_XS.gguf](https://huggingface.co/cjpais/llava-v1.6-vicuna-13b-gguf/blob/main/llava-v1.6-vicuna-13b.Q3_K_XS.gguf) | Q3_K_XS | 3 | 5.31 GB| very small, high quality loss |
15
+ | [llava-v1.6-vicuna-13b.Q3_K_M.gguf](https://huggingface.co/cjpais/llava-v1.6-vicuna-13b-gguf/blob/main/llava-v1.6-vicuna-13b.Q3_K_M.gguf) | Q3_K_M | 3 | 6.34 GB| very small, high quality loss |
16
+ | [llava-v1.6-vicuna-13b.Q4_K_M.gguf](https://huggingface.co/cjpais/llava-v1.6-vicuna-13b-gguf/blob/main/llava-v1.6-vicuna-13b.Q4_K_M.gguf) | Q4_K_M | 4 | 7.87 GB| medium, balanced quality - recommended |
17
+ | [llava-v1.6-vicuna-13b.Q5_K_S.gguf](https://huggingface.co/cjpais/llava-v1.6-vicuna-13b-gguf/blob/main/llava-v1.6-vicuna-13b.Q5_K_S.gguf) | Q5_K_S | 5 | 8.97 GB| large, low quality loss - recommended |
18
+ | [llava-v1.6-vicuna-13b.Q5_K_M.gguf](https://huggingface.co/cjpais/llava-v1.6-vicuna-13b-gguf/blob/main/llava-v1.6-vicuna-13b.Q5_K_M.gguf) | Q5_K_M | 5 | 9.23 GB| large, very low quality loss - recommended |
19
+ | [llava-v1.6-vicuna-13b.Q6_K.gguf](https://huggingface.co/cjpais/llava-v1.6-vicuna-13b-gguf/blob/main/llava-v1.6-vicuna-13b.Q6_K.gguf) | Q6_K | 5 | 10.7 GB| very large, extremely low quality loss |
20
+ | [llava-v1.6-vicuna-13b.Q8_0.gguf](https://huggingface.co/cjpais/llava-v1.6-vicuna-13b-gguf/blob/main/llava-v1.6-vicuna-13b.Q8_0.gguf) | Q8_0 | 5 | 13.8 GB| very large, extremely low quality loss - not recommended |
21
+
22
+ <br>
23
+ <br>
24
+
25
+ ---
26
+ inference: false
27
+ ---
28
+
29
+ <br>
30
+ <br>
31
+
32
+ # ORIGINAL LLaVA Model Card
33
+
34
+ ## Model details
35
+
36
+ **Model type:**
37
+ LLaVA is an open-source chatbot trained by fine-tuning LLM on multimodal instruction-following data.
38
+ It is an auto-regressive language model, based on the transformer architecture.
39
+ Base LLM: [lmsys/vicuna-13b-v1.5](https://huggingface.co/lmsys/vicuna-13b-v1.5)
40
+
41
+ **Model date:**
42
+ LLaVA-v1.6-Vicuna-13B was trained in December 2023.
43
+
44
+ **Paper or resources for more information:**
45
+ https://llava-vl.github.io/
46
+
47
+ ## License
48
+ Llama 2 is licensed under the LLAMA 2 Community License,
49
+ Copyright (c) Meta Platforms, Inc. All Rights Reserved.
50
+
51
+ **Where to send questions or comments about the model:**
52
+ https://github.com/haotian-liu/LLaVA/issues
53
+
54
+ ## Intended use
55
+ **Primary intended uses:**
56
+ The primary use of LLaVA is research on large multimodal models and chatbots.
57
+
58
+ **Primary intended users:**
59
+ The primary intended users of the model are researchers and hobbyists in computer vision, natural language processing, machine learning, and artificial intelligence.
60
+
61
+ ## Training dataset
62
+ - 558K filtered image-text pairs from LAION/CC/SBU, captioned by BLIP.
63
+ - 158K GPT-generated multimodal instruction-following data.
64
+ - 500K academic-task-oriented VQA data mixture.
65
+ - 50K GPT-4V data mixture.
66
+ - 40K ShareGPT data.
67
+
68
+ ## Evaluation dataset
69
+ A collection of 12 benchmarks, including 5 academic VQA benchmarks and 7 recent benchmarks specifically proposed for instruction-following LMMs.