nullt3r
/

Meta-Llama-3-8B-Instruct-64k-PoSE-Q8_0-GGUF

Text Generation

Inference Endpoints

Model card Files Files and versions Community

nullt3r commited on Apr 25

Commit

ca2ca0f

•

1 Parent(s): 8e77f37

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -13,7 +13,7 @@ pipeline_tag: text-generation
 ---
 # nullt3r/Meta-Llama-3-8B-Instruct-64k-PoSE-Q8_0-GGUF
-**The model performs well when used with LM Studio and the standard LLaMA 3 profile. However, I've found there is an issue in ollama, where it generates tokens continuously and never stops.**
 This model was converted to GGUF format from [`Azma-AI/Meta-Llama-3-8B-Instruct-64k-PoSE`](https://huggingface.co/Azma-AI/Meta-Llama-3-8B-Instruct-64k-PoSE) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
 Refer to the [original model card](https://huggingface.co/Azma-AI/Meta-Llama-3-8B-Instruct-64k-PoSE) for more details on the model.

 ---
 # nullt3r/Meta-Llama-3-8B-Instruct-64k-PoSE-Q8_0-GGUF
+**This is 64K context size model and performs well when used with LM Studio and the standard LLaMA 3 profile. However, I've found there is an issue in ollama, where it generates tokens continuously and never stops.**
 This model was converted to GGUF format from [`Azma-AI/Meta-Llama-3-8B-Instruct-64k-PoSE`](https://huggingface.co/Azma-AI/Meta-Llama-3-8B-Instruct-64k-PoSE) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
 Refer to the [original model card](https://huggingface.co/Azma-AI/Meta-Llama-3-8B-Instruct-64k-PoSE) for more details on the model.