mradermacher commited on
Commit
459eecb
1 Parent(s): 40d6649

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -0
README.md CHANGED
@@ -17,6 +17,7 @@ tags:
17
  <!-- ### vocab_type: -->
18
  weighted/imatrix quants of https://huggingface.co/elinas/Llama-3-8B-Ultra-Instruct
19
 
 
20
 
21
  <!-- provided-files -->
22
  static quants are available at https://huggingface.co/mradermacher/Llama-3-8B-Ultra-Instruct-GGUF
 
17
  <!-- ### vocab_type: -->
18
  weighted/imatrix quants of https://huggingface.co/elinas/Llama-3-8B-Ultra-Instruct
19
 
20
+ You should use `--override-kv tokenizer.ggml.pre=str:llama3` and a current llama.cpp version to work around a bug in llama.cpp that made these quants. (see https://old.reddit.com/r/LocalLLaMA/comments/1cg0z1i/bpe_pretokenization_support_is_now_merged_llamacpp/?share_id=5dBFB9x0cOJi8vbr-Murh)
21
 
22
  <!-- provided-files -->
23
  static quants are available at https://huggingface.co/mradermacher/Llama-3-8B-Ultra-Instruct-GGUF