cmp-nct
/

Yi-VL-6B-GGUF

Model card Files Files and versions Community

cmp-nct commited on Jan 24

Commit

7214001

•

1 Parent(s): c240ab5

Create README.md

Files changed (1) hide show

README.md +5 -0

README.md ADDED Viewed

	@@ -0,0 +1,5 @@

+This is a high quality quantization of Yi-VL-6B and of the visual transformer.
+Q5_K is almost equal to fp16 in inference and Q6_K is basically the same (not as reliably tested for visual transformers but I assume it's equal to language models)
+You currently need to apply this PR to make it work: https://github.com/ggerganov/llama.cpp/pull/5093 - this adds the additional normalization steps into the projection
+I do not like this model, it's hallucinating more than anything else based on llava.