bartowski
/

dolphin-2.9-llama3-8b-old-GGUF

Text Generation

Generated from Trainer

Model card Files Files and versions Community

bartowski commited on Apr 30

Commit

61bd3f5

•

1 Parent(s): 004b49e

Update README.md

Files changed (1) hide show

README.md +2 -0

README.md CHANGED Viewed

@@ -23,6 +23,8 @@ pipeline_tag: text-generation
 ## Llamacpp iMatrix Quantizations of dolphin-2.9-llama3-8b
 This model has the <|eot_id|> token set to not-special, which seems to work better with current inference engines.
 Using <a href="https://github.com/ggerganov/llama.cpp/">llama.cpp</a> fork from pcuenca <a href="https://github.com/pcuenca/llama.cpp/tree/llama3-conversion">llama3-conversion</a> for quantization.

 ## Llamacpp iMatrix Quantizations of dolphin-2.9-llama3-8b
+## This model has been deprecated in favour of the requanted version with tokenizer fixes here: https://huggingface.co/bartowski/dolphin-2.9-llama3-8b-GGUF
 This model has the <|eot_id|> token set to not-special, which seems to work better with current inference engines.
 Using <a href="https://github.com/ggerganov/llama.cpp/">llama.cpp</a> fork from pcuenca <a href="https://github.com/pcuenca/llama.cpp/tree/llama3-conversion">llama3-conversion</a> for quantization.