EOS token is getting printed

#1
by migtissera - opened

Not sure whether this is a quant error?

Screenshot 2024-06-17 at 5.38.08 PM.png

what client is this?

It's marked as not being special in your tokenizer_config.json which, from my experience, means that the engine doesn't know it's a special token that it's not meant to display

It's FreeChat, on Mac.

Ah.. Okay, I have no idea why. Is it too late to fix this once it's quantized? Like, is there a way to edit the GGUF file?

You can edit the metadata manually to fix it, I can also just remake them properly

Okay. I have edited the tokenizer_config.json here: https://huggingface.co/migtissera/Tess-v2.5-Phi-3-medium-128k-14B/blob/main/tokenizer_config.json

Can you do a small quant, like maybe an 8-bit, and then we can check it before you go ahead and make all the quants?

@migtissera uploaded fixed q8 to test

Okay, downloading now

Yes! Can confirm this is working perfectly now.

Screenshot 2024-06-18 at 1.14.07 PM.png

Please go ahead with rest of the quantizations. Thanks for your support and responsiveness!

migtissera changed discussion status to closed
This comment has been hidden

Sign up or log in to comment