Update tokenizer_config.json

by erichartford - opened about 18 hours ago

base: refs/heads/main

←

from: refs/pr/3

Discussion Files changed

-1

erichartford

about 18 hours ago

Use official chat template, that inserts a <think>

Update tokenizer_config.json7d72a9d7

erichartford

about 18 hours ago

•

edited about 18 hours ago

can you please go back to DeepSeek's official chat template that includes <think> to force it to start with a <think> otherwise it often skips thinking, or mixes thinking in with the response

https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Llama-70B/blob/main/tokenizer_config.json#L34

robertgshaw2

Neural Magic org about 16 hours ago

Thanks @erichartford for pointing this out. Did DeepSeek change the tokenizer? Just want to make sure we understand why it diverged

nm-research changed pull request status to merged about 13 hours ago

nm-research

Neural Magic org about 12 hours ago

@erichartford Great catch, thanks a lot for pointing it out! Indeed, DeepSeek updated tokenizer_config.json with this diff 14 days ago, and we forked their model for quantization on the initial release day; therefore, we haven't picked it up. We will update all of our quantized models.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment