bartowski commited on
Commit
3010898
1 Parent(s): d3f6fad

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -1
README.md CHANGED
@@ -13,7 +13,9 @@ pipeline_tag: text-generation
13
 
14
  ## Llamacpp iMatrix Quantizations of Llama-3-Orca-1.0-8B
15
 
16
- Using <a href="https://github.com/ggerganov/llama.cpp/">llama.cpp</a> release <a href="https://github.com/ggerganov/llama.cpp/releases/tag/experimental">experimental</a> for quantization.
 
 
17
 
18
  Original model: https://huggingface.co/Locutusque/Llama-3-Orca-1.0-8B
19
 
 
13
 
14
  ## Llamacpp iMatrix Quantizations of Llama-3-Orca-1.0-8B
15
 
16
+ This model has the <|eot_id|> token set to not-special, which seems to work better with current inference engines.
17
+
18
+ Using <a href="https://github.com/ggerganov/llama.cpp/">llama.cpp</a> fork from pcuenca <a href="https://github.com/pcuenca/llama.cpp/tree/llama3-conversion">llama3-conversion</a> for quantization.
19
 
20
  Original model: https://huggingface.co/Locutusque/Llama-3-Orca-1.0-8B
21