Update README.md
Browse files
README.md
CHANGED
@@ -13,7 +13,9 @@ pipeline_tag: text-generation
|
|
13 |
|
14 |
## Llamacpp iMatrix Quantizations of Llama-3-Orca-1.0-8B
|
15 |
|
16 |
-
|
|
|
|
|
17 |
|
18 |
Original model: https://huggingface.co/Locutusque/Llama-3-Orca-1.0-8B
|
19 |
|
|
|
13 |
|
14 |
## Llamacpp iMatrix Quantizations of Llama-3-Orca-1.0-8B
|
15 |
|
16 |
+
This model has the <|eot_id|> token set to not-special, which seems to work better with current inference engines.
|
17 |
+
|
18 |
+
Using <a href="https://github.com/ggerganov/llama.cpp/">llama.cpp</a> fork from pcuenca <a href="https://github.com/pcuenca/llama.cpp/tree/llama3-conversion">llama3-conversion</a> for quantization.
|
19 |
|
20 |
Original model: https://huggingface.co/Locutusque/Llama-3-Orca-1.0-8B
|
21 |
|