abetlen
/

paligemma-3b-mix-224-gguf

Model card Files Files and versions Community

abetlen commited on Oct 3

Commit

17f3ac9

•

1 Parent(s): b3ab4c2

Update README.md

Files changed (1) hide show

README.md +1 -0

README.md CHANGED Viewed

@@ -25,6 +25,7 @@ llm = Llama.from_pretrained(
   repo_id="abetlen/paligemma-3b-mix-224-gguf",
   filename="*text-model-q4_k_m.gguf",
   chat_handler=chat_handler,
   n_ctx=2048, # n_ctx should be increased to accommodate the image embedding
   n_ubatch=512, # must be large enough to fit image embeddings and text input in a single batch
   n_batch=512

   repo_id="abetlen/paligemma-3b-mix-224-gguf",
   filename="*text-model-q4_k_m.gguf",
   chat_handler=chat_handler,
+  n_gpu_layers=-1,
   n_ctx=2048, # n_ctx should be increased to accommodate the image embedding
   n_ubatch=512, # must be large enough to fit image embeddings and text input in a single batch
   n_batch=512