Missing ChatML special tokens

So I modified the files but I'm getting this error because of the addition of the <|im_start|> and <|im_end|> tokens
ValueError: Parameter model.embed_tokens.q_weight has shape (32000, 512), but expected (32002, 512)

Weyaxi

Owner Jan 18

Have you changed all tokenizer related files to match chatml?

brlambert

Jan 18

Yes I believe so, but adding these new special tokens seems to be imcompatible with the model. I tried to fix this by editing the problematic layer shape with model.resize_token_embeddings(32002) and I am able to generate outputs with this new model but it's all gibberish

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment