MaziyarPanahi/Meta-Llama-3-70B-Instruct-GGUF · EOS token ID changed for unquantized version

Hugging Face

EOS token ID changed for unquantized version

#16

by Gnurro2 - opened May 10

Discussion

Gnurro2

May 10

See https://huggingface.co/meta-llama/Meta-Llama-3-70B-Instruct/discussions/49/files
The EOS token oversight is now being fixed, but the GGUF files here still have the old ID. Would be great if this gets fixed.
With the old ID, the models kept generating until they hit the token limit.

MaziyarPanahi

Owner May 11

•

edited May 11

It's really not an issue if the library you are using has stop strings set to <|eot_id|> and it works without any issue. (Ollama, LM Studio, etc.`)

If what you use to serve doesn't support terminators/stop_strings, you can edit the GGUF metadata tokenizer.eos_token_id to 128009 yourself easily.

PS: https://huggingface.co/MaziyarPanahi/Meta-Llama-3-70B-Instruct-GGUF/discussions/7

MaziyarPanahi changed discussion status to closed May 11

Gnurro2

May 11

I'll do that, then.
Just seeing that 300k+ people downloaded these model files as the most convenient way of using Llama3, and many users do not know how to pull off that edit. Would save them the hassle. ;)

MaziyarPanahi

Owner May 12

Makes sense, I'll see if I can re-upload the edited files this weekend :)

Gnurro2

May 14

Great, thanks for the reuploads! ...I think you missed the Q8, though. ;)

MaziyarPanahi

Owner May 14

You are totally right, uploading it now :)

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment