Tokenizer issue

by apoorvumang - opened 9 days ago

MLX Community org 9 days ago

There is something wrong with the tokenizer, it keeps generating multiple chat turns until max_tokens is reached.

I'm currently bypassing it by using tokenizer from llama-3-8b

tokenizer = load_tokenizer(get_model_path("mlx-community/Meta-Llama-3-8B-Instruct-4bit"))

if someone can fix tokenizer here pls help, model is quite good (uncensored)

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment