mradermacher
commited on
Commit
•
459eecb
1
Parent(s):
40d6649
Update README.md
Browse files
README.md
CHANGED
@@ -17,6 +17,7 @@ tags:
|
|
17 |
<!-- ### vocab_type: -->
|
18 |
weighted/imatrix quants of https://huggingface.co/elinas/Llama-3-8B-Ultra-Instruct
|
19 |
|
|
|
20 |
|
21 |
<!-- provided-files -->
|
22 |
static quants are available at https://huggingface.co/mradermacher/Llama-3-8B-Ultra-Instruct-GGUF
|
|
|
17 |
<!-- ### vocab_type: -->
|
18 |
weighted/imatrix quants of https://huggingface.co/elinas/Llama-3-8B-Ultra-Instruct
|
19 |
|
20 |
+
You should use `--override-kv tokenizer.ggml.pre=str:llama3` and a current llama.cpp version to work around a bug in llama.cpp that made these quants. (see https://old.reddit.com/r/LocalLLaMA/comments/1cg0z1i/bpe_pretokenization_support_is_now_merged_llamacpp/?share_id=5dBFB9x0cOJi8vbr-Murh)
|
21 |
|
22 |
<!-- provided-files -->
|
23 |
static quants are available at https://huggingface.co/mradermacher/Llama-3-8B-Ultra-Instruct-GGUF
|