Fix eos_token in tokenizer_config.json

by AlexCheema - opened Apr 20

base: refs/heads/main

←

from: refs/pr/1

Discussion Files changed

-1

AlexCheema

Apr 20

eos_token was set to "<|eot_id|>" when it should be "<|end_of_text|>"
This caused bugs like generation never ending because it never hits the eos_token.

Fix eos_token in tokenizer_config.json20a59fa0

prince-canuma

MLX Community org Apr 20

Instruct models don't have this issue.

It's only the base models.

prince-canuma changed pull request status to closed Apr 20

rb17

Apr 29

I have the same issue.
when setting the max_tokens to 1024 the model doesn't stop generating.

prince-canuma

MLX Community org Apr 30

Please give me a reproducible example :)

rb17

May 1

hi,
As suggested in the model card, but with max_tokens=1024-

'''
from mlx_lm import load, generate

model, tokenizer = load("mlx-community/Meta-Llama-3-8B-Instruct-4bit")
response = generate(model, tokenizer, prompt="hello", verbose=True, max_tokens=1024)
'''

The model does not stop generating text, as shown in the next discussion: huggingface.co/mlx-community/Meta-Llama-3-8B-Instruct-4bit/discussions/3

rizwan47

May 1

•

edited May 1

Hey @prince-canuma

Here's the sample code:

from mlx_lm import load, generate
from markdown import markdown
from IPython.display import Markdown, display

model, tokenizer = load("mlx-community/Meta-Llama-3-8B-Instruct-4bit")

response = generate(model, tokenizer, 
                prompt="What is 5 plus 5?",
                verbose=True,
                max_tokens=400)

And attached is the response it's giving me.

prince-canuma

MLX Community org 30 days ago

Can you run the same command using the terminal and share the results?

python -m mlx_lm.generate --model ... --prompt "..."

prince-canuma changed pull request status to open 30 days ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Ready to merge

This branch is ready to get merged automatically.

· Sign up or log in to comment