Quantization Q2_K

by sharonsky - opened Feb 3

Discussion

sharonsky

Feb 3

I want to quantize the model in Q2_K gguf but there is not enough merges.txt. Please add merges.txt or tokenizer.model

andreatironi

Feb 3

Where can i find gguf file?

sharonsky

Feb 6

Where can i find gguf file?

gguf file is created, but I can't add a dictionary there, because it should be a SentencePiece. To convert to SentencePiece, I need merges.txt. And without a dictionary, the gguf file is useless. I can upload it, maybe someone will finish it

andreatironi

Feb 6

In comment #5 probably there is a solution.

f-buciuni

Almawave org Feb 7

A pull request in the llama.cpp repository (https://github.com/ggerganov/llama.cpp/pull/11716) has already been submitted to address this issue and is currently under review. You can refer to the fork used for the pull request or wait for the marge to convert the model to GGUF format and to quantize it using the allowed methods (i.e. Q2_K).

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment