bartowski
/

dolphin-2.9-llama3-8b-old-GGUF

Text Generation

Generated from Trainer

Model card Files Files and versions Community

bartowski commited on Apr 21

Commit

b31a3ed

•

1 Parent(s): e94f84f

Update README.md

Files changed (1) hide show

README.md +3 -1

README.md CHANGED Viewed

@@ -23,7 +23,9 @@ pipeline_tag: text-generation
 ## Llamacpp iMatrix Quantizations of dolphin-2.9-llama3-8b
-Using <a href="https://github.com/ggerganov/llama.cpp/">llama.cpp</a> release <a href="https://github.com/ggerganov/llama.cpp/releases/tag/experimental">experimental</a> for quantization.
 Original model: https://huggingface.co/cognitivecomputations/dolphin-2.9-llama3-8b

 ## Llamacpp iMatrix Quantizations of dolphin-2.9-llama3-8b
+This model has the <|eot_id|> token set to not-special, which seems to work better with current inference engines.
+Using <a href="https://github.com/ggerganov/llama.cpp/">llama.cpp</a> fork from pcuenca <a href="https://github.com/pcuenca/llama.cpp/tree/llama3-conversion">llama3-conversion</a> for quantization.
 Original model: https://huggingface.co/cognitivecomputations/dolphin-2.9-llama3-8b